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SOME INCREASING PARENT-CHILP Pes. 


SIMILARITIES DURING THE 
GROWTH OF CHILDREN 


NANCY BAYLEY 


Institute of Child Welfare, University of California 


Age-changes in the correlation between children’s mental test 


scores and some measure of parental ability have been shown for 
several different populations. Such reports have often been inci- 
dental to other aspects of research on intelligence. It has seemed 
worth while, therefore, to relate some of these findings to relevant 
material, in both mental and physical relationships, in the Berkeley 
Growth Study. 


PARENT-CHILD CORRELATIONS IN THE LITERATURE 


It is only rarely that mental test scores are available for the 
parents of tested children. But there are a few instances in which 
intelligence test scores of parents and their children have been 
compared directly. One of the most extensive and important of 
these was made by Jones and Conrad (8, 11, 12). They correlated 
Army Alpha test scores of parents with Alpha and Stanford- 
Binet scores of their children. In one hundred and forty families, 
in which both parents and children were given the A y Alpha 
test, four hundred and forty-one pairings gave a 
49. The same r of .49 was obtained for five hundred 
in one hundred eighteen families in which the chil ha 
Stanford-Binet test. The r is also .49 for separate 1 » moth 
and father-child comparisons.' 

Among the most recently published parent-child test compari- 
sons are the correlations presented by Skodak and Skeels between 


1 See (13) for a report on age differences in r’s for this sample. In earlier 
studies the authors did not break down the data by comparing children of 
different ages with their parents. e 
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IQ’s of true mothers (tested near the time the children were born) 

and their children who have grown up in foster homes (15). In 

this study the r’s for sixty-eight cases gradually rose from zero 

when the children were two years of age to .38 and .44 on the latest 

tests of the children, given when they averaged thirteen and one- 
phat yesrs 


Zan Ther’s for these two studies are, for several reasons, not directly 


comparable. However, the earlier study gives the general relation- 
ships to be expected when scores are available on both parents, 
and for a large population of children (age three years and older) 
growing up with their own parents. In the later study the compari- 
sons are made repeatedly for the same population, the variable 
factor being the ages of the children at the time of testing. Although 
in this study the r’s are lower, they give evidence of increasing 
mother-child similarity in intellectual capacities as the childréh 
grow older. This is impressive in view of the fact that the children 
were separated from their true mothers in early infancy. Further- 
more, correlations of the children’s IQ’s with educational status 
of foster parents remained close to zero (ranging at the thirteen 
and one-half year tests from .00 to .06). 

In the absence of direct measures of intelligence, some other 
estimate of parental ability has often been used. We will concern 
ourselves here only with some of these studies which report cor- 
relations for children of different ages. In the Berkeley Growth _ 
Study (2, 6), and in the Guidance Study which has been reported 
by Honzik (10), several socio-economic factors have been correlated 
with the children’s mental test scores at successive ages up to eight 


' and ten years. In both studies the parent-child r’s are zero or even 


t first, but increase as the children grow older. Honzik 
the time the Guidance Study children were eight 
ental test scores correlated .36 with mid-parent 
h a socio-economic index, and .55 with a rating 
lligence. By the time the Berkeley Growth Study 
ten years old, their intelligence scores correlated 


around .55 with such factors as socio-economic ratings and educa- 
tion of parents. 


PARENT-CHILD CORRELATION IN THE BERKELEY GROWTH STUDY 


The Sample—In the present report some parent-child compari- 
seys are made on both mental and physical characteristics of the 


TABLE I—Means AND STANDARD DEVIATIONS OF COMPARABLE ADULT 


EDUCATIONAL AND PHYSICAL DATA FOR CHILDREN AND THEIR PARENTS 


Heights (Inches) Mean sD 

Mature* boys 71.63 2.38 
Boys at 18 years 1 » 71.36 2.51 
Parents of boys S T 

Fathers ath zi ; 70.57 2.46 

Mothers f TR po 64.97 50 

Mid-Parentt f tf? 70.37 2.11 
All fathers E Aos 69.69 2.73 
Girls at 18 or mature 21 65.29 2.57 
Parents of girls 

Fathers 20 68.72 2.67 

M>thers 20 63.92 2.33 

Mid-Parentt 20 63.67 1.90 
All mothers 42 64.47 2.48 

3 Weights (Pounds) 
Mature boys 22 157.49 13.09 
Boys at 18 Years 21 154.40 12.64 
All fathers 41 161.71 19.30 
Mature girls 24 129.87 19.45 
Girls at 18 years 20 126.90 16.15 
All mothers 42 132.76 18.74 
Weight/Height? Indicest 
Boys at 21 years 17 218.8 17.5 
Boys at 18 years 21 213.2 12.2 
All fathers 41 233.0 25.1 
Girls at 21 years 20 210.1 28.6 
Girls at 18 years 20 212.6 26.6 
All mothers 42 224.2 27.1 
‘Education (Years of Schooling Completed) 

Boys 23 13.87 3.38. 
Girls 23 14.67 2.05 
Fathers 43 13.66° 3.82 
Mothers 43 12.79 3.12 
Mid-Parent 43 13.22 2.89 


* Heights after closure of all epiphyses in hand and knee, often after 


eighteen years. 


t Boys’ mothers’ heights multiplied by 1.08; girls’ fathers’ heights multi- 


plied by 0.923. 


t Indices computed from metric units. 


3 


e 


© 


e 


2 


» 


4 The Journal of Educational Psychology 


children in the Berkeley Growth Study whose records are now 
complete through twenty-one years. The number of children tested 
at any given age varies from forty to sixty-one. Data are available 
for eighteen years or longer for forty-seven children from forty- 
three families. Table I gives the means and SD’s for both parents 


gom chädren in the characteristics to be considered here. 


J 


In heights and weights this group of parents is probably typical 
of California adults of similar age and racial antecedents. They 
are, for the most part, better educated, and have a generally higher 
socio-economic status, than a cross section of the Berkeley popula- 
tion with which they were compared at the outset of the Growth 
Study (6). 

In comparison with their parents the children seem to be favored, 
both physically and mentally. They are, on the average, about an 
inch taller than their parents, but five pounds lighter. Conse- 
quently, they are slenderer in build, but this difference may be 
due to the fact that they are still younger than most of the parents 
were at the time the parental measurements were obtained. 

Even though in a number of instances the children’s educations 
are not yet completed, we find that they have had more education 
than their parents. This is true of the girls, in particular, but at 
least four of the boys are now in medical school or other graduate 
study. The mean IQ of these children at eighteen years on the 
Wechsler-Bellevue Scale was 122, SD 16 (3). 

Mental—As stated above, we do not have intelligence-test 
scores for the parents. Among the socio-economic variables studied, 
the parents’ education was found to correlate most highly with the 
children’s mental-test scores. 

In Table II and Figure 1 these previously reported relations for 
the Berkeley Growth Study are reviewed and brought up to date. 
For graphical representation the product-moment r’s have been 
transformed to z’s.? Tu this figure, the z transformations are based 
on r’s between the midparent education (average of the years of 
schooling completed by mother and father) and the children’s 
mental test sigma scores at every test age from birth through 
eighteen years. That is, we have compared a single evaluation of 
parental ability with a succession of different scores of their chil- 
dren’s intelligence. It is evident that the parent-child correlation 


3 From the table on page 348 in McNemar (14). 
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TABLE II.—CORRELATIONS (AND Z Scores) BETWEEN Mip-PARENT 
EDUCATION AND CHILD’S MENTAL TEST SCORES AT EACH AGE 


TESTED 

Age Mid-Parent Education by N r z 
Mo. 1 | California First-Year Scale 52 —.01 —.01 

2 | Califognia First-Year Scale 58 — .02 7 
8 | California First-Year Scale 61 —.27* | —.28 
4 | California First-Year Scale 58 —.22 —.22 
5 | California First-Year Scale 58 —.29* | —.30 
6 | California First-Year Scale 57 —.23 —.23 
7 | California First-Year Scale 52 —.19 —.19 
8 | California First-Year Scale 53 —.05 —.05 
9 | California First-Year Scale 56 +.06 +.06 
10 | California First-Year Scale 56 +.08 +.08 
11 | California First-Year Scale 52 +.00 +.00 
12 | California First-Year Scale 53 +.03 + .03 
13 | California First-Year Scale 53 — .05 — .05 
14 | California First-Year Scale 46 —.02 —.02 
15 | California First-Year Scale 52 —.00 —.00 
18 | California Preschool Scale 51 +.16 +.16 
21 | California Preschool Scale 53 -29t .30 
24 | California Preschool Scale 48 -50t 55 
27 | California Preschool Scale 51 „41 44 
30 | California Preschool Scale 47 44 47 
36 | California Preschool Scale 49 -47. -51 
42 | California Preschool Scale 43 .35 -37 
48 | California Preschool Scale 45 -50 55 
54 | California Preschool Scale 44 -59 -68 
60 | California Preschool Scale 47 58 -66 
Ye 6 | 1916 Stanford-Binet 48 57 65 
7 | 1916 Stanford-Binet 46 -50 55 
8 | 1937 Stanford-Binet 47 59 -68 
9 | 1937 Stanford-Binet 45 57 65 
10 | 1937 Stanford-Binet 47 54 -60 
11 | 1937 Stanford-Binet 48 -61 71 
12 | 1937 Stanford-Binet 46 -59 -68 
13 | Terman-McNemar 41 -62 72 
14 | 1937 Stanford-Binet 40 -60 -69 
15 | Terman-McNemar 42 -56 63 

& 
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Taste II.—Continued 
Age Mid-Parent Education by N r À z 
16 | Wechsler-Bellevue 44 .64 .76 
17 | 1937 Stanford-Binet 43 .65 .78 
$s Wechsler-Bellevue 40 -60 -69 


*The negative r’s for months three and five are significant at the .05 
level of confidence or better. 

t The positive r at month twenty-one is at the .04 level of confidence. 
After this the r’s are all highly significant, usually above .001. 


is at first zero and even becomes negative during the second quarter 
year (at month five the r of —.29 is significant at the .03 level of 


confidence). The r’s hover around zero from seven months through 


fifteen after which they rise rapidly to .50 at two years. 
The positive r’s become statistically significant after eighteen 
months, the level of confidence being .01 or better from two years 
n 


J} When the sexes are treated separately, there appears to be little 


(sex difference in the parent-child relationships in mental ability, 
lin this group, although the girls’ scores are oftener than the boys’ 
‘significantly correlated with parental status. The r’s are given for 
ten ages in Table IIT, 

Bayley and Jones, in an early report on these changing relation- 
ships (6), discussed the question of the probable contributions of 
hereditary and environmental factors, The evidence seemed to 
indicate that hereditary factors emerged gradually as the children 
matured, but the proportional contribution of environmental fac- 
tors could not be determined, Conrad in 1931 (7) reviewed studies 
of kin resemblancés in physique and intelligence, He concluded 
that when various relevant factors were controlled, the kin re- 
semblances in physical and intellectual characteristics were about 
the same. He pointed out, however, that the mere similarity of 
correlation coefficients gives no proof of similar etiology of the 
resemblances. 

P hysical.—The question has arisen whether changes in parent- 
child similarities might occur during growth in other areas as well 
as in mental scores. Are there similar changes, for example, in 
physical characteristics which are relatively unaffected by environ- 
Fe differences, given a sample of normally healthy children 
(1) within a normal environmental range? To test this, we have 
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Correlation Between Midparent Education 


2 Volue and Children's Intelligence Scores 


Cite 4 6 8 ORNETA Me ene 
Age of Children 


Fig. 1, Correlations between Children’s Intelligence Sigma Scores and 
the Years of Schooling (Average of Mother and Father) of the Parents are 
Plotted by Age. Z-values are shown at the left, Correlation Coefficients at 
the right. 


correlated some available parent-child physical data, i.e., heights 
and weights, for the Berkeley Growth Study. 

The parents’ heights and weights were obtained by the visiting 
nurse in an interview which occurred soon after the children were 
born, at a time when most of the parents were young adults in 
their twenties. The measurements were not made under laboratory 
conditions, but may be considered fairly good approximations. 
The children were measured nude, during their regular visits to 
the Institute, according to exact anthropometric techniques (5). 

Correlations have been computed for three physical variables 
(height, weight, and an index of build, W/H2), for the parents as 
adults with these same measurements of the children made at 
nineteen ages (and in some cases thirty ages). These 7’s are given 
in Tables IV through VII. Because of physical sex differences, it is 
necessary to treat the sexes separately. 

The parent-child correlations in height are given in Table IV. 
The r’s with sons tend to increase with age, although unevenly. 
The father-son r’s are significant after three years, but the mother- 
son r’s are not significant until thirteen years. The girls are sig- 
nificantly correlated with their mothers after the first few monfis, 
but not with their fathers, except possibly toward maturity. © 
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Taste III.—CORRELATIONS BETWEEN CHILDREN’S MENTAL Test Scores 
AND THEIR PARENTS’ EDUCATION: SEXES SEPARATE 


Boys Girls 
CA N Father Mother N Father Mother 
g>” P r r da r r 

Mo. 6 31 —.29 .00 29 —.14 | —.05 
Yr. 1 28 04 24 28 —.08 .02 
3 26 .39* | .49t 26 26 .31 

6 25 -34 .47* 26 „56t -61t 

8 24 .34 .40* 26 64t -59t 

10 24 .44* | 38 #26 58t „51t 

14 21 .45* | .45* 21 60+ .52* 

15 21 42 42 21 52* 42 

16 22 .48* | .40 22 6lt -63t 

18 21 .43* | 41 19 „60t -63t 


* Significant at the .05 level of confidence or better. 
f Significant at the 0.1 level of confidence or better. 


Midparent values for height have been computed by making an 
adjustment in the height of the parent of the sex opposite to that 
of the child (9). This is done by multiplying the heights of the boys’ 
mothers by 1.08 and the heights of the girls’ fathers by 0.923 and 
averaging the adjusted heights of the two parents. (In this sample 
the boys’ mothers and fathers correlate with each other 31 in 
height and .04 in weight. For the girls’ parents the corresponding 


ı Ps are: height, .10, weight .11). The correlations of the children’s 


heights with the midparent values are shown in Table V and Figure 
2. The boys show a continuously increasing correlation with their 
parents; the r’s become significant at about four years of age. The 
girls show an early rapid increase, becoming highly significant by 
five months, after which the r’s level off, then seem to pick up again 
after twelve years. However, from the data in Table IV, it would 
appear that these relationships with the midparent values are due 
primarily to similarities with the parents of the same sex. 

The correlations for weight are shown in Table VI. The parent- 
son 7s are rather low, but there is a trend toward increasing re- 
lationship between the boys and their fathers, the r’s reaching the 
05 level of confidence by fifteen years. The father-daughter r’s 
ingicate some relation by five years, but they never reach statistical 


2] 
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TABLE IV.—CORRELATIONS (AND Z Scores) BETWEEN HEIGHTS oF 
PARENTS AND THEIR CHILDREN 


Boys Girls 
Age Fathers Mothers Fathers Mothers 
N J 
ele [ele | rfs|* |= 
Mo, 1 24 | .30 | .31) .05 05 | 26 | .36 | .38 | .30 31 
6 30 | .25 | .26 | .27 28 | 28 | .42* | .45 | .47¢ | .51 
E L 28 | .30 | .31 | .31 32 | 28 | .36 38 | .67ł | .81 
2 23 | .28 | .29 | .21 21 | 19 | .34 35 | .56* | .63 
3 25 | .45* | .49 | .25 26 | 24 | .38 40 | .67ł | .81 
4 22 | .52* | .58 | .25 26 | 23 | .35 | .37 | .66f | .79 
5 23 | .52* | .58 | .30 31 | 24 | .24 | .25 | .70ł | .87 
6 24 | .52t | .58 | .35 37 | 24 | .26 | .27 | .75t | .97 
7 24 | .53ł | .59 | .35 37 | 21 | .33 | .34 | .767.| .996 
8 23 | .54ł | .60 | .43 46 | 24 | .26 | .27 | .74t | .95 
9 23 | .56t | .63 | .43 46 | 22 | .30 | .31 | .71f | .89 
10 23 | .60ł | .69 | .37 39 | 24 | .24 | .25 | .72ł | .91 
il 24 | .57 | -65 | .41* | .44 | 21 | .29 | -30 | .68t | .83 
12 20 | .53* | .59 | .41 44 | 22 | .24 | .25 | .74t | .95 
13 24 | .45* | .49 | .50* | .55 | 17 | .56* | .63 | .62¢ | .73 
14 20 | .58f | .66 | .69t | .85 | 17 | .49* | .54 | .66t | .79 
15 21 | .42 | .45 | .71¢ | .89 | 20 | .43 | -46 | .68t | .83 
16 22 | .47* | .51 | .55¢ | .62 | 19 | .42 | .45 | .66¢ | .79 
17 20 | .64t | .76 | .60f | .69 | 20 | .44 | .47 | .66f | -79 
-18 21 | .52* | .58 | .44 | .47 | 19 | .64* | .76 | .52f | .58 
21 17 | .44 | .47 | .38 | .40 |20| .50* | .55 | .67ł | -81 


* Significant at the .05 level of confidence or better. 
+ Significant at the .01 level of confidence or better. 


significance. The mother-daughter correlations, however, show 
strongly increasing similarities in weight through sixteen years. 
They reach the .05 level of confidence with an r of .43 at two years 
and the .01 level by six years, when the r is .57, with 7’s considerably 
higher at most of the later ages. 

In our studies of growth we have found the W/H? index very 
useful as a measure of body build in children. It gives a measure of 


breadth, or volume, relative to stature. The parent-child correla- © 


tions for this index are given in Table VII. The relative independ- 


ence of the boys, especially in regard to weights, is reflecte@#in 
© 


10 


Taste V.—Corretations BETWEEN Mip-Parent Huicuts AnD HEIGHTS 
or THEIR CHILDREN 


The Journal of Educational Psychology 


Age N 
r 

gP Mo, P 24 -21 
2 32 22 
3 32 -24 
4 81 27 
5 31 23 
6 32 35* 
7 27 .37 
8 26 -36 
9 28 .36* 
10 29 .32 
11 28 -36 
12 29 -39* 
18 23 -22 
Yr. 2 27 -36 
3 26 .44* 
4 25 -52t 
5 23 .50* 
6 26 -56t 
7 24 At 
8 24 -58t 
9 2 -60t 
10 24 .60f 
i 25 60t 
12 21 -59t 
13 24 .62ł 
14 20 -82t 
15 21 „68t 
16 22 62 
17 20 1 
18 21 .58t 
21 17 .51* 


G 


* Significant at the .05 level of confidence or better. 
t Significant at the .01 level of confidence or better. 
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Correlations Between Midparent Height and Heights of Children 


2 Value 
12 


Šf. 
Br“ 2 Girls with Parent: e =, 
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8 10 12 
Age of Children 

Fig. 2. The Solid Line and Dots Show the Z-values (and Correlations) 
between the Heights of the Boys and a Midparent Score (Boys’ Mothers’ 
Heights are Multiplied by 1.08 and Averaged with the Fathers’ Heights). 
Broken Line and Open Dots Show these Values for the Girls and their 
Parents (Girls’ Fathers’ Heights are Multiplied by .923 before averaging 
with the Mothers’ Heights). 


these r’s: in the W/H? index the boys are unrelated to their parents 
at all ages. The girls’ indices however, show increasing r’s with 
both parents. The father-daughter r’s approach significance at 
years eight, nine and ten, and again at sixteen and seventeen years. 
Mother-daughter r’s are usually above the .01 level of confidence 
at most of the later ages. 

The sex differences in correlation are in some instances very 
striking. Even though the samples are small, statistical tests indi- 
cate that the differences may well be significant (Table VIII). 
In heights (Tables III and VIII) the boys are found to be more 
closely related to their fathers than to their mothers, at the .05 
level of confidence or better, at years four, five, six, ten, and eleven; 
while the girls are more closely related (.05 level) to their mothers 
at all ages except thirteen and eighteen years, the difference being 
highly significant for years seven to twelve (.01 or better). 

Tn general, also, the girls show more significant relation to their 
Parents than do the boys. Their height correlations with midpart 
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Taste VI—CORRELATIONS (AND Z Scores) BETWEEN WEIGHTS OF 
PARENTS AND THEIR CHILDREN 


Boys Girls 
Age Fathers Mothers Fathers Mothers 
N N N 
g DI z r $ r og r z 
Mo. 1| 25 |—.05 |—.05 | .35| .37 | 25 |—.20 |—.20 | 26 | .41* | .44 
6| 31| .13| .13| .23| .23 | 27 |—.16 |—.16 |28 | .23 .23 
Yr. 1/28] .19| .19| .20| .20|27] .12| .12] 28] .35 .37 
9) 25| .26| .27| .02| .02/21| .01) .01 | 22| .43*| .46 
3} 25| .33| .34| .07| .07 |23| .24| .25| 24] .47*| .51 
4| 21| .23| .23 |—.12 |—.12 |22| .19| .19 |23| .41*| .44 
5| 23 | .34| .35| .09| .09|23| .37| .30| 24] .47*| .51 
6| 24| .32| .33| .23| .23 |23| .30] .31] 24] .57ł| .65 
7| 24| .23 j| .23| .2| .20|20| .45*| .49 |21| .62ł] .73 
8| 23 | .29 | .30| .10| .10|22| .40| .42 |23| .67t| .8i 
9| 23| .28| .29| .06| .06 |20| .40| .42 |21 j| .68t] .88 
10} 23 | .31 | .32 |—.01 |—.01 | 23| .30| .31| 24] .74t | .95 
11 24| .23 | .23| .07| .07| 20) .32| .33| 21] .71t| .89 
12| 20 | .30| .31| .01| .01|21| .20| .20 |22| .73t| .93 
18} 24] 30| <31 | 17 | .17 |16] 19| .19]17 | .51t | .56 
14| 20 | .36 | .38 | .15| .15 |16] .22| .22 |17 | .89t | 1.42 
15] 21 | .46*| .50| .15| .15|19| .25| .26 |20| .88ł | 1.38 
16] 22 | .45*| .49 |—.09 |—.09 | 18 | .30| .31 |19 | .85t | 1.26 
17| 20 | .49*| .54| .28| .29/19] .42] .45 |20| .67t| .81 
18| 21 | .44| .47| .13| .13|18| .31| .32 | 19 | .43 46 
21| 17 |—.02 |—.02 | .10] .10} 19] .16| .16 |20| .81f | 1.13 


* Significant at the .05 level of confidence or better. 
t Significant at the .01 level of confidence or better. 


height (Tables IV and VIII) are significant at the .05 level or better 
for all but two of the fifteen r’s during the first three years: after 
that the boys’ r’s are equally high. In weight (Tables V and VIII) 
girls are more significantly correlated with their mothers than are 
boys with their fathers, at thirteen of twenty ages, eight of these 
-at the .01 level of confidence or better. 


CHILDREN’S SELF-CORRELATIONS 

eat wake may be relevant here to point to the low relationship between 
these children’s measurements in infancy and their own measure- 
mets as young adults. From Table IX, we see that mental test 


p ? 
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Taste VII.—CORRELATIONS (AND Z Scores) Berween W/H? INDEX oF 
PARENTS AND THEIR CHILDREN 


Boys Girls 
Age Fathers Mothers Fathers Mothers 
N N N 
CMa r z r z ¢ z: e 
Mo. 1| 24 |—.15 |—.15 | .32 33 | 25 |—.15 |—.15 | 26 | .13 .13 
6j 30 |—.09 |—.09 | .23*| .23 | 27 |—.31 |—.32 | 28 | .13 .13 
Yr. 1) 28 |—.12 |—.12 | .30*) .31 | 27 |—.10 |—.10 | 28 | .07 07 
2| 24 23 | .23 |—.11 |—.11 | 19 |—.04 |—.04 | 19 | .26 27 
8| 25 |—.04 |—.04 16 16 | 23 |—.01 |—.01 | 24 | .14 14 
4| 21 |—.27 |—.28 13 13 | 22 06 06 | 23 | .13 13 
5| 23 |—.07 |—.07 26 27 | 23 19| .19 | 24] .14 14 
6| 24 |—.24 |—.25 28 29 | 23 | .33 | .34 | 24 | .22 22 
7| 24 |—.14 |—.14 31 32 | 20 | .39 | .41 | 21) .39 41 
8| 23 |—.18 |—.18 14 14 | 22 46*| .50 | 23 | .44* 47 
9| 24 |—.12 |—.12 04 04 | 20 45*| .49 | 21 | .53* 59 
10} 23 |—.04 |—.04 |—.16 |—.16 | 23 42*| 45 | 24 | .64t 76 
11| 24 |—.03 |—.03 |—.03 |—.03 | 20 31 32 | 21 | .75t 97 
12} 20 |—.06 |—.06 |—.10 |—.10 | 20 37 39 | 21 | .65t 78 
13| 24 09 09 06 06 | 16 | .40 42 | 17 | .35 37 
14| 19 |—.08 |—.08 | .22 | .22|16] .30] .31 | 17] .8if | 1.13 
15) 20} .17 | .17} .16] .16/19] .31] .82 | 20] .78f | 1.05 
16) 22 |—.09 |—.09 | .08 | .08 |18| .44] .47] 19] .67¢]} .81 
17/20; .14] .14| .14] .14]19] .51%) .56 | 20] .41* |] .44 
18} 21 | .14 | .14| .03| .03/18| .47*| .51 | 19 | .17 AT 
21| 17 |—.54 |—.60 | .25| .26 |19| .43 |] .46/ 20] .69f | .85 


* Significant at the .05 level of confidence or better. 
t Significant at the .01 level of confidence or better. 


scores earned during the first year are unrelated to their eighteen- 
year test scores (3). As a matter of fact, the r’s of tests during the 
first nine months drop to zero or below when comparisons are made 
with scores at three years or older. However, the two-year and 
eighteen-years tests correlate .55, and starting at seven years all 
rs with eighteen-year tests are above .80. 

Growth in size is somewhat more stable, as may be seen in Table 
X. During the first year, the children’s heights correlate in the 
neighborhood of .5 to .6 with their heights as adults. By the time 
they are three years old the heights of both sexes correlate above i 
:83 with their own adult statures. 

The self-correlations in weights, on the other hand, preseņġ a e 
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Tasie VIII.—Stentricance or Sex DIFFERENCES IN PARENT-CHILD 


CoRRELATIONS 
In Height In Weight 
Boys Girls 
her with cr with | _ Girls correlate higher Girls correlate higher 
Gm Aeae | Maratas | wit Meiners | i te Bong, 
deni 
Mri ok wai 01 Mo, 1 -03 Mo. 1 01 
2 oye 04 2 04 Yr. 2 -05 
3 =a .02 3 04 3 = 
4 +03 02 4 02 4 05 
5 04 005 5 015 5 iy 
6 .05 .02 6 -02 6 .03 
7 fs +005 7 .03 7 .01 
8 rer 003 8 cP 8 -01 
9 os -007 9 01 9 .01 
10 01 -003 10 015 10 -005 
11 .05 01 11 ar 11 -004 
12 =A -003 12 .02 12 006 
14 ane -05 18 -O1 13 05 
15 ae 03 Yr. 2 .02 14 .001 
16 ne -03 3 02 15 -001 
17 a 04 16 -001 
17 .04 


* Not given if below the .05 level of confidence. 


picture more like that of the parent-child relationships for weight. 
Although the self-correlations are pretty consistently higher than 
the same-sex parent-child r’s, the trends of correlation, with age, 
tend to be parallel. 

Some correlations illustrating different degrees of consistency 
in build are shown for two indices in Table XI. The Stem 
Length/Height index is used to characterize the relative lengths 
of the legs and the body stem. The indices obtained on the Berkeley 
Growth Study cases at eighteen years are correlated with their 
indices at three earlier ages, three, twelve and thirty-six months. 
For both sexes the r’s are similar. They are lowest at one year, and 
in general show only a moderate relation to later status. The 
Weight/Height? index is correlated for the same age-intervals. This 
index of build shows no consistency in the boys between eighteen 

D 
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TABLE IX.—CORRELATIONS BETWEEN EARLIER MENTAL TEST SCORES AND 
WecasLer-BELLEVUE IQ ar 18 Years 


Age Test N r 
Mo. 1 California First-Year Scale 36 07 
6 California First-Year Scale 36 zl 
Wretey, Calif@rnia First-Year Scale 35 +25 
2 California Preschool Scale 36 55 
3 California Preschool Scale 36 49 
4 California Preschool Scale 34 66 
5 California Preschool Scale 36 76 
6 1916 Stanford-Binet 36 77 
7 1916 Stanford-Binet 36 -81 
8 1937 Stanford-Binet, L 36 85 
9 1937 Stanford-Binet, L 34 87 
10 1937 Stanford-Binet, M 36 86 
11 1937 Stanford-Binet, L 35 93 
12 1937 Stanford-Binet, M 34 89 
13 Terman-McNemar, C 81 93 
14 Stanford-Binet, L 33 89 
15 Terman-McNemar, D 33 88 
16 Wechsler-Bellevue 36 94 
17 Stanford-Binet, M 36 90 


years and the first three years of life. The girls, on the other hand, 
appear to be very stable from one year. 

_ The greater stability of the physical measurements and especially 
of heights, could be due to any or all of the following: (a) greater 
reliability of measurement, (b) greater stability during growth of 
the component parts in the values compared, and (c) smaller 
influence of environmental forces in producing individual differ- 
ences, at least for this sample. Weight is very reliably measured, 
but it does vary with growth changes in body proportions, and 
with health and nutrition. 


INDIVIDUAL TRENDS IN GROWTH 


The changing status of a child, during his growth, may be il- 
lustrated by plotting a curve of his relative (or sigma) scores with 
age. Six such curves are shown in Figures 3 and 4. They include 
both mental scores and height scores, and on each child’s chart 
the relative status of his own parents (compared with the parental 


e 
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TABLE X.—Curipren’s EARLIER HEIGHTS AND WEIGHTS CORRELATED 


WITH THEIR Own Marurs Hereuts anp WEIGHTS 


Height Weight 

Age Boys Girls Boys Girls 
od 2 N r N r N |r| N r 
Birth = — |-|- 23} 80) 23 .51 
Mo. 1 18 -64 | 20 | .87 | 19 | .19| 20 31 
3 23 aB 28 S18"). 23 „51 
6 22 e O E Sega) aL .40 
9 21 59 | 2 | .68 | 21 | 12] 23 46 
Yoi 21 T2 | 19 | .65 | 23 | .26| 22 „51 
2 21 68 | 19 | .68 | 21 | 42] 21 .62 
3 21 -83 | 19 | .85 | 23 | 44] 22 „59 
4 21 -85 | 19 | .82 | 20 | .32| 23 .63 
5 21 86 | 19 | .84 | 22 | 44] 23 -68 
6 21 -88 | 19 | 85 | 22 | 42) 2% 79 
7 21 87 | 19 | .85 | 23 | 52] 23 „81 
8 21 -87 | 19 | .84 arani O o 22 .87 
9 21 -89 | 19 | .88 | 23 | .52] 22 -86 
10 21 89 | 19 | .80 | 22 | 44] 24 .87 
11 21 88 | 19 | .64 | 23 | .60] 21 81 
12 21 88 | 19 | .77 | 20 | .58] 23 .78 
13 21 82 | 19 | .88 | 22 | .67| 19 .53 
14 21 -78 | 19 | .73 | 19 | .54| 20 -86 
15 21 74 | 19 | .98 | 20 | .53| 20 -92 
16 21 79 | 19 | .997 | 22 | .65| 22 -95 
17 21 92 | 19 | .999] 20 | .76| 23 .94 
18 21 8 | — | — 21 | 89) — = 


TABLE XI.—CORRELATIONS ror CONSISTENCY oF Two Bopy BurLD 
Inpicus: Excursen Years COMPARED WITH THREE 
EARLIER AGES 


Stem Length/Height Weight/Height? 
i 
rat Boys Girls noe Girls 
N r N r N r N 1 
8 | 21 47 | 17 43 21 10 1 
4 5 8 22 
12 | 21 | .08 | 19 -20 21 —.09 20 -81 
d 36 | 21 .45 19 -61 21 .08 19 -71 
> 
W) 
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STANDARD SCORES FOR HEIGHT 
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Case 14F 


Age O 2 4 6 8 10 i2 n l6 1 20 


Fios: 3 and 4. The Children’s Sigma Scores relative to the Means@or 
the Berkeley Growth Study for Successive Ages are shown by Dots and 
Smoothed Curve. For Comparison the Parents’ Sigma Scores relative to @ 
the Parents in this Group are indicated by a Straight Horizontal Line. 
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mean) is indicated by a broken horizontal line. As would be ex- 
pected from the correlations, most (though not all) children’s 
scores tend to settle at a level near that of their parents. But the 
individual curves bring out another fact, that some children reach 
that level much sooner than others. í 

The different patterns in physical growth can be shown to be 
due, in part, to differences in rates of maturing. Vor example, case 
18F matured very slowly: her skeletal age was retarded for all 
ages after two years, usually lagging behind at least a full year. 
When she finally reached maturity she had attained a tall stature 
similar to that of her parents. Case 8M was rather consistently 
short, like his parents: his skeletal age was average until thirteen 
years, after which it lagged somewhat, apparently postponing his 
reaching an eventual status slightly taller than his parents. Case 
3M, on the other hand, was accelerated at least a year at all ages 
after two years, and he tended to be relatively taller than his par- 
ents until he approached skeletal maturity. After sixteen years 
his scores fall below the parental line. 

We do not understand clearly the nature of the maturing proc- 
esses in mental functions. There could be inherent differences in 
rates, and also differential environmental factors operating to re- 
tard or enhance intellectual growth. But if we think of these chil- 
dren’s scores at twenty-one years as measures of their mature intel- 
lectual status, it is clear that they follow varied paths in attaining 
that status. Case 8M had an early short period of rapid develop- 
ment, and after two years maintained a position that is close to 
both his parents’ and his own mature level. Case 5F started out 
slowlyg but caught up to her own adult and her parental level by 
three and one-half years. Case 14F, however, after a very pre- 
cocious start settled more gradually to a position which never 
dropped quite as low as her parents’ educational status, but stabi- 
lized at about eight years, A 

Examination of all the individual curves (in addition to those 
shown here) reveals a great variety of trends, with examples of 
(relatively) rapid or slow growth occurring at all ages, both in 
heights and in intelligence scores. ‘ 


SUMMARY AND INTERPRETATION 


d Most of the physical comparisons show increasing parent-child 
sipilarities as the children grow older. However, there appear to be 
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some differences, both in the amounts of relationship and in the 
age-trends, Because the samples are small, it is possible that these 
differences are peculiar to the samples. If the samples are repre- 
sentative in these respects, it would appear that the within-sex re- 
lationships are stronger than those between parent and child of the 
opposite sex. Also, the mother-daughter relations seem tobe closer 


than the father-s® relationships. aR, 


The trends in mental relations differ in several ways from those 
. in physical relations. The children themselves are more stable in 
the physical measurements. But the trends of increasing similarity 
to their parents seem more consistent in the mental scores. Al- 
thought there are no sex differences in the parent-child r’s for the 
mental factors, there appear to be some marked sex differences 
in the physical factors, The analogy between the mental and physi- 
cal relations is, obviously, not a close one. Furthermore, a variety 
of questions could be asked about genetic relations in physical 
growth which cannot be resolved here. 

From these data it does, however, seem reasonable to assume 
the existence of avhereditary core of parent-child similarities in 
both mental and physical characters, even though such similarities 
may not be evidenced during the first year or so of the child’s 
life. The similarities tend to increase as the children approach the 
mature status at which the characteristics were observed in their 
parents. Inherent differences in rates of development during in- 
fancy could account, at least in part, for the early lack of parent- 
child similarity. 

Other factors operative in the reduced r’s at early ages are clearly 
observable in the phenomena of physical growth. These are the 
differences in the infants’ body proportions from those of adults, 
and the differential growth rates of different parts of the body. 
For example, an infant’s total height is made up in large part by 
his stem length, while the adult’s height is determined to a much 
greater extent by the length of his legs. In the Berkeley Growth 
Study, the mean stem-length/height ratio drops from .65 at three 
months to .51 between twelve and fourteen years (signifying rela- 
tively longer legs), and then rises slightly to about .53 in the young 
adult. The transverse dimensions of the body, as contrasted with 
the longitudinal dimensions, grow more rapidly during the first 
year after birth and again during early adolescence. 

Analogous age changes in organization could be expected in Pg 
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complex function during the process of its development (4). If we 
accept this position, then we may assume that differences in the 
mental organization of the infant and the adult could contribute 
to the changing parent-child relations in mental ability. Credence 
is lent to this hypothesis by the study of Skodak and Skeels (16) 
ho report increasing mother-child r’s which cannot be attributed 
environment. The effects of environmental influences should, 
consequently, be evaluated in the light of both age changes in 
organization (or relative maturity of the different parts or factors) 
and individual differences in over-all rates of maturing. 

Although the sample used here is too small to allow for anything 
but tentative interpretations, it seems very likely that the chang- 
ing parent-child relationships exhibited here are widespread. They 
deserve more extensive investigation in other populations and in 


other characteristics. 
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TWO METHODS OF TEACHING ENGLISH 
COMPOSITION TO RETARDED 
COLLEGE FRESHMEN’ 


RAY CHARLES MAIZE 


Air University, Maxwell AF Base, Alabama 


This study was established to provide a comparison between a 
‘language by experience’ procedure and a ‘grammar drill’ method 
of teaching composition to retarded students in English A sections 
at Purdue University. Its purposes were: (1) to determine which 
method teaches the student more effective habits of language ap- 
plication as measured by objective testing and sample writing; (2) 
which method requires more extensive outside work and prepara- 
tion on the part of the instructor; and (3) what relative factors, such 
as reading skill and mental levels, influence the student’s ability 
to achieve success. 

Sections of English A at Purdue University were used for the 
experiment. These slow and retarded students were separated from 
the other freshmen at Purdue by the scores they received on the 
American Council on Education Psychological Examination, the 
Purdue University Placement Test in English, and an original 
theme written during the entrance examination period. If their 
percentiles on the ACE Psychological Examination and the Purdue 
Placement Test in English averaged below 30 and their grade on 
the original theme (graded by the English faculty) was below 3 
(passing), then they were assigned to English A, a non-credit, one- 
semester course which they must take before going on to English 1. 

Recognizing the deficiencies of these students, the English faculty 
at Purdue had established English A as a remedial measure to bring 
the student to some degree of proficiency which would enable him 
to pass English 1, the required course for freshman credit in Eng- 
lish. Taught according to the program established by the Englisb 
department, English A consisted of workbook drill of grammar, 
punctuation, and spelling; and a weekly theme, corrected by the 
instructor. These procedures were supposed to be specifically de- 


1 This article is based on the writer’s doctoral dissertation, Purdue Uni- 
versity, 1952. Grateful acknowledgment is made to Professor Einar R- 
Raren under whose direction this work was done. 
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signed to change the language habit patterns of slow, retarded 
freshmen. 

To find out how well English A achieved the purpose for which 
it was established, permission was granted by the head of the Eng- 
lish department to conduct an experiment in which the officially 
sanctioned method of instruction would be contrasted against 
writing laboratory method. Accordingly, one group, the control, 
was organized and taught using the method established by the 
English department. The other group, the experimental, was taught 
using the writing laboratory method. 

Inherent in this latter method would be the need to free the 
instructor from extensive theme reading, for it was recognized at 
the beginning that merely doubling or tripling the number of stu- 
dent mansucripts would provide no answer for the one great burden 
of teachers of English: how to get student writing activity without 
increasing the already too heavy load of the teacher. The second 
objective, then, of the experiment would be the need to prove that 
the teacher does not have to edit all student writing, that more 
effective results can be obtained if the teacher deliberately refrains 
from overdoing such supervision while at the same time encourag- 
ing voluminous efforts by the students. 

The third objective of the experiment lay in the need to dis- 
cover the special problems of the retarded students. What factors, 
in other words, had greatest relevance to their success in the course? 
Was it reading skill, mental level, vocabulary, attitude, or feeling 
of failure or success? Just as important, therefore, as the degree of 
control over his language habits which the experiment should re- 
veal was the necessity to learn more about the individual who was 
to be taught. 


STRUCTURE OF THE EXPERIMENT 


From the freshmen students assigned to English A sections at 
Purdue durmg the year 1949-1950, one hundred and forty-nine 
were selected at random to become a part of this experiment to de- 
termine relative merits of two different approaches to the teaching 
of language skill on a college freshman level. 

The method of selection for membership in any one of the groups 


R 


has already been described. On the basis of these factors, students 


were registered by chance in the sections used for the experiment: 
seventy-four to the control group and seventy-five to the experi- 
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. mental group. The control group was taught by the grammar 

workbook drill method; the experimental group was taught by the 

writing laboratory method. 

During the semester of instruction, the following tests were ad- 

ministered: the Terman-McNemar Test of Mental Ability; the 

i g Lenmon:Nelson Test of Mental Ability; the Iowa Silent Reading 

Test; the Rinsland-Beck Natural Test of English Usage, Tests I, 

II, and IT; the Seashore Recognition Vocabulary Test; essays on 

an identical subject written before and after the period of instruc- 

tion; the Horrall College Experience Scale; the Silance-Remmers 

T Scale to Measure Attitude Toward Subject; and the Silance-Rem- 
J mers Scale to Measure Attitude Toward Teacher. 

To determine the amount of change and the significance of 
change before and after instruction, six units of measurement were 
established. These units were: (1) identical forms of the Seashore 
Recognition Vocabulary Test to determine change in total vocabu- 
lary; (2) equivalent forms A and B, Test I of the Rinsland-Beck 
Natural Test of English Usage, to measure change in ability to 
capitalize and punctuate; (3) equivalent forms A and B, Test II 
of the Rinsland-Beck to determine change in ability to work with 
diction, tense, pronoun reference, and verb forms; (4) equiválent 
forms A and B, Test III of the Rinsland-Beck to measure change 
in ability to improve sentence structure, organize logically, and 
write effectively; (5) combined scores for Test I, II, and III of the 
Rinsland-Beck to obtain composite language results; and (6) aver- 
age of numerical theme grades awarded by two different English 
instructors after reading student manuscripts on an identical sub- 
ject assigned before and after instruction. 

Since one of the stated purposes of this study was to determine 
what relative factors have influence on the student's ability to 
achieve Success, two different intelligence tests were used, in addi- 
tion to the ACE Psychological Examination. The Iowa Silent 
Reading Test was also used to discover the relationship of scores 
in reading to intelligence scores and to other evaluation forms. 

Tt was also considered important to find out what effect, if any, 
the personality and enthusiasm the various instructors might 
have on the results. In addition, the attitude of depression or 

» buoyancy, caused by success or failure, might also have an effect 
on the findings. For these reasons the Silance-Remmers Scale to 
Mrnsure Attitude Toward Subject, the Scale to Measure Attitude 
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Toward Teacher, and the Horrall College Experience Scale were 
used. 

During the period of instruction the control group wrote fourteen 
themes of approximately two hundred and fifty words in length. 
These themes were read and corrected by the instructor outside the 
classroom. The experimental group wrote forty themes of @pproxi aay, 
mately two hundred and fifty words in length. The instructor did 
not read or correct any of this writing outside the regularly sched- 
uled class hours; instead analysis and comment were provided in 
the writing laboratory by fellow students and the instructor. 


e 


ANALYSIS AND INTERPRETATION OF DATA AS 


In this research the control and experimental groups were con- 
sidered equal in ability at the beginning of the study. Data for indi- 
cations of equality were obtained from the mean of the raw scores 
received by the students on the ACE Psychological Examination, 
the Purdue Placement Test in English, and the numerical theme 
grade. The t-ratios which were obtained for the difference between 
these means were not statistically significant according to t-tables. 


MEASURE OF CHANGE 


Table 1 indicates the results obtained from the six units of meas- 
urement which were established to determine the amount of 
change and the significance of change for the control and experi- 
mental groups. 
As shown in Table 1, changes on the vocabulary test were fav- 
orable and significant at the one per cent level of confidence for 
both groups. A t-ratio for the difference between the changes, how- 
ever, was not statistically significant. The difference between 
changes of the two groups could, therefore, be due to chance. 
On Test I of the Rinsland-Beck the change was significant at 
the one per cent level of confidence for both groups. A t-ratio for 
the difference between changes, however, was significant at the 
five per cent level of confidence. The change for the experimental 
group, therefore, was significantly larger. , 
On Test II of the Rinsland-Beck the change for the control group 
was significant at the 5 per cent level of confidence, for the experi- E 
mental group at the one per cent level of confidence. A t-ratio for P 
the difference between changes was acceptable at the five per cag 
e 
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Taste I. Means and t-Ratios for Control and Experimental Groups on 
Measurements to Determine Change before and after Instruction 


Mean of Mean of R 
Control Experimental gratio, 


Before | After | Before | After Contr Exp 

J sp 
Seashore Recognition Vo- | 61.1 | 58.6 | 62.5] 57.8 | —3.6ł| —5.9t 
cabulary Test 


Test I, Rinsland-Beck | 55.4 | 59.0 | 55.0] 62.8 5.14} 9.8ł 
Natural Test of English 
Usage 


Test II, Rinsland-Beck | 53.1 | 55.2 | 49.3] 54.9 2.3t) 5.6t 
Natural Test of English 
Usage 


Test III, Rinsland-Beck | 29.5 | 28.3 | 24.9] 383.2] —1.2 5.9t 
Natural Test of English 
Usage 


Combined Scores, Rinsland-| 138.1 | 142.4 | 129.2 | 150.9 3.1 | 12.8} 
Beck Natural Test of 
English Usage 


Numerical Grades Given | 9.7| 9.1] 10.1| 11.9 | —1.2| 6.0 
By Theme Readers 


t1% level of confidence. 
t 5% level of confidence. 


level of confidence. The experimental group, therefore, achieved * 
greater change than the control group. 

On Test III of the Rinsland-Beck the control group showed no 
change that was statistically significant. The experimental group, 
on the other hand, showed change which was significant at the one 
per cent level of confidence. The t-ratio for the difference of change 
was also significant at the one per cent level. On this form, directly 
testing the student’s ability to write, the evidence was all in favor 
of the experimental group. 

On the Combined Scores of the Rinsland-Beck the total change 
for both groups was statistically significant at the one per cent 
level of confidence. The t-ratio for the difference of change, how- 


ayer, was significant at the one per cent level of confidence in favor 
D of the experimental group. 
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On Numerical Grades Given by Theme Readers, the control 
group showed no significant change. The experimental group, on 
the other hand, did show positive gain in ability to write, signifi- 
cant at the one per cent level. The t-ratio for the difference of 
changes was also significant at the one per cent level. 


e 
® CORRELATION OF FACTORS -= 


One of the stated purposes of this study was to determine the 
relationship to one another of the many factors involved, in an 
effort to understand more clearly the nature of the teaching task 
and to obtain the necessary insight into those factors which seemed 
to contribute most to success or failure on the part of the student. 

The following details are worthy of note: 

a) The mean chronological age of the control group was 19.5 
years; the mean age of the experimental group was 19.6 years. 

b) The mean raw scores for both control and experimental 
groups on the Henmon-Nelson Test of Mental Ability were re- 
ported by the authors of the test at the 40th percentile, indicating 
that the students who form the population for this study were at 
least in the lower half of any population reported on by stand- 
ardized tests. 

c) When the mean raw scores on the Terman-McNemar Test of 
Mental Ability are compared with tables provided by the authors, 
the results show a mean IQ for both groups of 99 and a percentile 
ranking of 48. 

d) The highest degree of correlation found between the Iowa 
Silent Reading rate and any other evaluation factor was that of 
.51 with the Q score on the ACE Psychological Examination. Other- 
wise the rate of reading seemed to bear little relationship with any- 
thing else. 

e) The mean of the standard comprehension score as reported 
for the control and experimental groups on the Iowa Silent Reading 
Tests show a percentile ranking of 16 and a grade equivalent of 
10.2, indicating a two to three year retardation in reading ability 
for students who are supposedly working on a 13th grade level. 


EFFECT OF ATTITUDES 


It was considered important to the results of this study to find 
out what effect, if any, the personality and enthusiasm of the vari- 
ous instructors might have. In addition, the attitudes of depress, 
or buoyancy, caused by success or failure in the course, might also © 


e 
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have an effect upon the findings. It therefore became necessary to 
discover whether such moods differed between the control and ex- 
perimental groups. On the Silance-Remmers Scale to Measure 
Attitude Toward Subject and Attitude Toward Teacher no differ- 
ence could be found between attitude reactions of students in 
gontrol,gnd experimental groups. Similarly, no difference could be 
ee found on the Horrall College Experience Scale to measure depres- 
sion or buoyancy. The conclusion was drawn that the lack of signifi- 
cant difference in measurement of attitude or mood indicated a 
lack of influence on the evaluation techniques used in this study. 

: SUMMARY AND CONCLUSIONS x 

As a result of the methods of instruction and the testing con- 
ducted before, during, and after the period of instruction, these 
conclusions can be drawn: 

a) The control group showed statistically significant change 
measured before and after instruction on vocabulary, mechanics, 
and grammar. The experimental group, on the other hand, showed 
statistically significant change as measured before and after in- 
struction on vocabulary, mechanics, grammar, controlled composi- 
tion, and free composition. 

b) Furthermore, the experimental group showed a statistically 
significant larger change on all measurements except vocabulary. 
All other measurements used in this study, therefore, showed 
clear evidence of superiority on the part of the experimental groups 
as the result of the methods of instruction. 

c) All data revealed that students were of limited mental capac- 
ity and possessed an initial low level of ability in language use, * 
vocabulary, and reading. In addition, changes for both low-level 
groups were slight, even after a semester of instruction. i 

d) No evidence could be found of the effect of student attitude 
or mood on the results of this study. 


DIFFERENTIAL RESPONSES AMONG COLLEGE 
STUDENTS ON THE VOCABULARY SUBTEST 
OF THE WECHSLER INTELLIGENCE SCALE 


CHALMERS L. STACEY and 8. WILLIAM SPANIER 


Syracuse University 


Gerstein (1) in an article in the Journal of Consulting Psychology 
in 1949 suggests a relationship between the use of language and the 
underlying cognitive process involved. She proposes placing more 
emphasis on this relationship instead of merely determining 


whether or not the subject has given the correct definition of a ' 


word on a vocabulary test. That is, one should, in addition to de- 
termining whether the definition is correct or incorrect, attempt 
to differentiate the type of definition which is considered cor- 
rect. 

Reichard, Schneider and Rappaport (2) using the Goldstein- 
Weigl (3) color-sorting test as an instrument for investigating con- 
cept formation in children arrived at the conclusion that there 
exist three basic methods for forming concepts: (a) the concretistic, 
(b) the functional and (c) the conceptual. 

Gerstein on the basis of the Reichard, Schneider and Rappaport 
study feels that the same three methods of concept formation 
which were exhibited in that study to solve a performance prob- 
lem may also be used in classifying the definitional levels of 
responses to verbal stimuli. She suggests the following three classifi- 
cations which parallel the aforementioned three: (a) the concretis- 
tic or descriptive method which is considered to be at the lowest 
level of verbalized definitions (the most primitive level), e.g., 
“an apple is red” or “a diamond is very hard and glitters, shines 
and sparkles”; (b) the functional or usage method, a more complex 
level and considered to be at the next higher level of verbalized 
definitions, e.g., “a coat is something you wear to keep warm” 
or “a microscope is used to make observations of minute objects”; 
and (c) the categorical or conceptual or abstract method, the 
most complex method and considered to be at the highest level of 
verbalized definitions, e.g., “wood is a fuel” or “a bicycle is a means 
of transportation”. 
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STUDIES MADE ATTACKING THIS PROBLEM 


In two separate studies, Stacey and Portnoy (4, 5) attempted 
to test the hypothesis put forth by Gerstein. Both studies used 
groups of individuals that were subnormal in intelligence as de- 
termined by the Wechsler Intelligence Scale. In their first study, 


@ermvhen ccmparing the results of two groups of children, one classified 


as borderline and the other as mentally defective, they arrived at 
the following results, having used the Vocabulary Subtest of the 
Wechsler Intelligence Scale for Children for the verbal stimuli: 
(1) the borderline group (higher on the scale of intelligence) gave 
a greater proportion of descriptive type definitions than did the 
mental defective group—in fact, the two groups were significantly 
different at the one per cent level, (2) the mental defective group 
gave a greater proportion of the functional type of definition—this 
difference was also significant at the one per cent level, and (3) 
there was no significant difference between the two groups where 
the conceptual type of definition was concerned though the border- 
line group did show a slightly higher proportion of this type of 
definition. 

Tn their second study, using three groups of adults, one classified 
as dull-normal, one as borderline and the last as moron, they 
found no significant differences in terms of the critical ratios be- 
tween any two groups of the three in any of the three categories 
(descriptive, functional or conceptual). Nevertheless, the same 
general trend was evident. As in the first study, the higher level of 
intelligence gave a greater proportion of descriptive type defini- 


tions than did the next lower level of intelligence, and a lower | 


proportion of functional type definitions than did the next lower 
level of intelligence, and a slightly higher proportion of conceptual 
type of definitions than did the next lower level of intelligence. 
Stacey and Portnoy, in this same study, then compared the two 
groups which comprised the two studies but omitting the dull- 
normal group of the second study, i.e., they compared the adult 
moron (of the second study) with the child moron (of the first 
study) and the adult borderline (of the second study) with the 
child borderline (of the first study) thereby holding intelligence 
constant and age as a variable. The following results were obtained: 
(a) the older age groups gave a greater proportion of descriptive 
and abstract types of definition, the difference being significant 
*@Hthe one per cent level, (b) the younger age groups gave a greater 
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proportion of functional definitions. Again the difference was 
significant at the one per cent level. 

The results of these two studies appear directly to contradict 
Gerstein’s hypothesis of the levels of methods of concept formation 
with the exception of the abstract or conceptual method. Gerstein’s 
hypothesis is thus contradicted in terms of levels of intelligence 
and the Reichard, Schneider and Rappaport hypothesis (which e 
includes the statement that children think at the concrete or 
descriptive level up to about age eight, and somewhere between 
the ages of eight and nine the functional level is reached, and at 
about the age of eleven the conceptual or abstract level is reached) e 
is thus contradicted in terms of age. 

Tn another study, Stacey and Markin (6) used the Similarities & 
Subtest of the Wechsler Bellevue Intelligence Scale Form I with 
three groups of adults classified as (1) mental defective, (2) border- 
line and (3) dull-normal. Again as in the two previously mentioned 
studies by Stacey and Portnoy, the same trend may be noted. 
The greater proportions of descriptive responses were made by 
the higher levels of intelligence—the difference between the mental 
defectives and the dull-normal groups being significant at the one 
per cent level; the difference between the other groups, though not 
significant, showed the same trend. Also, the greater proportion of 
functional responses were made by the lower levels of intelligence 
—the difference between the mental defectives and dull-normal 
being significant at the one per cent level; the difference between 
the mental defectives and the borderline groups being significant 
at the five per cent level and the difference between the borderline 
and dull-normal groups, though not significant, again showed the 
same trend, The differences between the three groups at the 
abstract level were much too small to draw any conclusions. 

Since all the results thus far have been gained through the use 
of individuals of subnormal intelligence as subjects, it is the pur- 
pose of this paper to further test Gerstein’s hypothesis, but this 
time using as subjects one hundred eighty-eight college juniors 
and seniors. 


METHOD 


The vocabulary section of the Wechsler Bellevue Intelligence =, 
was administered to one hundred eighty-eight college stu- 
dents. The responses given by the students to each of the forty 
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five words in this section were recorded verbatim. The response 
sheet of each student was graded in accordance with the instruc- 
tions cited in the Wechsler Bellevue Manual. Two separate groups 
of students were established by selection of the highest twenty-five 
per cent of the grades and the lowest twenty-five per cent of the 
grades (forty-seven students in each group). Each correct definition 
given by each student for each of the nouns (daly nouns were used 
for analysis) listed on the response sheet was then broken down 
into one of the three categories: descriptive, functional or concep- 
tual. 

Each breakdown was then recorded in its proper column on a 


chart, and the total number of correct responses for each student “ 


also recorded on the chart, A grand total of correct responses was 
then calculated and recorded on the same chart. Separate totals 
were also calculated for each of the three categories and the pro- 
portions of each of the three Separate categories to the grand total 
number of correct responses computed. This procedure was followed 
separately for the two groups—the upper twenty-five per cent 
and the lower twenty-five per cent. It might also be said that a 
third group was established from the middle twenty-five per cent 
of the grades and the same procedure followed with this group. 
Of course, any significant difference that might exist would come 
to light in the critical ratios computed between the high and low 
groups. The middle group should only serve to bridge the gap 
between these other two groups, and it is for this reason only that 


the third group was established. Critical ratios were then computed 


between the proportion of responses given by the three groups of 
students in each category, i.e., a critical ratio was computed for 
the proportion of descriptive responses in the high group as com- 
pared to the proportion of descriptive responses in the low group, 
and also the proportion of descriptive responses in the middle 


— 


group was compared to the proportion of descriptive responses of 


both the high and low groups separately. The results of these 
comparisons are presented in Table I. 


RESULTS 


There was no significant difference between the proportions of 


prcrivtive Tesponses given by the top and middle groups, but 
D 
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TABLE I.—FREQUENCY AND PROPORTIONS OF LEVELS oF RESPONSE TO THE 
VOCABULARY SECTION OF THE WEcHSLER-BELLEVUE INTELLIGENCE 
Scare 


Top 25% Middle 25% ‘Bottom25%| Top 25% 
CR CR CR 


Category 
Freq.|Prop. Freq.| Prop. Freq.|Prop.| Freq.| Prop. 


Abstract 1043] .74|4.05*| 880| .67 |2.114| 741| .63/6.11%71043 TPE o 
Functional 23| .02/6.66*| 90| .07 78| .07|6.49*| 23| .02 
Descriptive 339| .24|1.21 | 347| .26 |2.22f] 347| .30/3.43%) 339) .24 


Total 1405 1317 1166 1405) 


* Significant at the one percent level. 
t Significant at the five percent level. ES 


the trend was still evident—an increase in the proportion of de- 
scriptive responses given by the higher group to the next lower 
group. The proportion of descriptive responses between the middle 
and bottom groups differed significantly at the five per cent level. 

2) The proportion of functional responses made by the top and 
bottom groups differed significantly at the one per cent level. The 
proportion of functional responses between the top and middle 
groups differed significantly at the one per cent level, but there 
was no difference at all in the proportions of functional responses 
between the middle and bottom groups. 

8) The proportion of conceptual responses made by the top and 
bottom groups differed significantly at the one per cent level. The 
proportion of conceptual responses between the top and middle 
groups differed significantly at the one per cent level and the pro- 
portion of conceptual responses between the middle and bottom 
groups differed significantly at the five per cent level. 

4) Each of the groups gave the greatest proportion of responses 
inthe conceptual category, the next highest proportion of responses 
in the descriptive category and the lowest proportion in the func- 
tional category. 


SUMMARY 

The responses of the three groups of college students (groups 

arbitrarily established on the basis of scores on the Vocabulary 

Subtest of the Wechsler Bellevue Intelligence Scale) were analyzed 

as to the proportions of their vocabulary responses that fell into 
the descriptive, functional or conceptual categories. e 
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According to the Gerstein hypothesis the descriptive type of 
definition is the most primitive, the functional type of definition 
is the next highest level, and the conceptual type of definition is 
the most complex. We should therefore expect that in any study 
of this kind we would find that the groups of higher intelligence 


ee would give (1) a greater proportion of conceptual type defini- 
» 


ion than would the groups of lower levels of “Intelligence, (2) a 
somewhat higher proportion of functional type definition than 
would the groups of lower levels of intelligence and (3) a lower 
proportion of the descriptive type of definition than would the 
groups of lower levels of intelligence. But in the three studies 
mentioned earlier in this paper, the results substantiated Ger- 
stein’s hypothesis in only one of the three categories—the concep- 
tual. The other two categories—the functional and the descriptive 
—ran directly counter to the hypothesis set forth by Gerstein. 

The results of this study are in agreement with the results of 
the previous studies on only two of the three categories—the 
conceptual and the functional. The discrepancy lies in the descrip- 
tive category. We therefore find that the results of this study 
substantiate Gerstein’s hypothesis on two of the three categories— 
the conceptual and the descriptive. That is to say that: (1) the 
top group gave a higher proportion of conceptual responses than 
did the middle group and the middle group a higher proportion 
than did the bottom group, (2) the top group gave a lower pro- 
portion of descriptive responses than did the middle group and 
the middle group a lower proportion than did the bottom group; 
in contrast to Gerstein’s hypothesis the top group gave a lower 
proportion of functional responses than did the middle group and 
there was no difference at all in the proportion of functional re- 
sponses between the middle and the bottom group. 

The discrepancy which exists between these results and those of 
the three previous studies—namely, the discrepancy in the descrip- 
tive category—may, of course, be due to the fact that in the 
previous studies subjects of subnormal intelligence were used 
whereas the sample in this study was composed of college students. 
Certainly a great deal more research is necessary before any 
absolute conclusions may be drawn or even assumed. However, 
the results of all four studies do leave in doubt that part of Ger- 
stein’s hypothesis which states that the functional type of defini- 
tig is of a higher level than is the descriptive. But here again 
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further research is necessary before these results may be considered 
conclusive. 
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ITEM VALIDITY AND RESPONCE CHANGE 
UNDER TWO DIFFERENT TESTING 
CONDITIONS! 


DOUGLAS G. SCHULTZ 


D å 
es Educational Testing Service ” 


Princeton, N. J. 


Tt has often been demonstrated that various responses to the 
usual self-description inventories and questionnaires can be elicited 
by altering the motivational pressures under which the instrument 
is administered. Bennett and Gordon (1), Ruch (4), and Schott 
(6) have all reported studies in which persons who took this type 
of test as part of the selection process for school or job gave con- 
siderably more favorable responses than persons who took the 
test under non-selection circumstances. Using the same question- 
naire with which this report will be concerned, Myers (3) found a 
large number of statistically significant differences between the 
mean item responses of students accepted for college and those of 
college applicants. He also found that the applicants presented 
themselves generally as being more sober and industrious and 
better adjusted than the accepted students. Results obtained by 
Hollingworth (2), Simpson (8), and Van Wagenen (9) also support 
the point. When the answers given to inventories and question- 
naires may have a decisive bearing on the attainment of some 
desired goal, it is apparent that many people either intentionally 
or unintentionally tend to respond in a manner which they think 
or feel will present the best opportunity for achieving that goal. 

The study reported in this paper examined such shifts in re- 
sponse from the standpoint of the individual item. (Throughout 
the paper, responses to items and validities of items will be dis- 
cussed.) The purpose of the research was to determine how changes 
in the item responses to an attitude-interest questionnaire admin- 
istered under selection and non-selection testing conditions are 
related to the validities of the questionnaire items for an academic 
criterion. Specifically the two problems investigated could be 


1 This paper was read at the American Psychological Association meeting 
in Bishington, D. C., on September 2, 1952. 
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phrased as follows: 

1) Does the average response to an item change more (from 
non-selection to selection administrations) if the item is of high 
validity than if it is of low validity? 

2) Do the items to which the average response is changed most 


also decrease mogt in validity? . < 


HYPOTHESES, 


With regard to the first of these problems, it was anticipated 
that the correlation between mean item response change and 
external item validity would be zero, on the grounds that students 
seeking admission to college would exhibit no consistent insight 
into the way the items operate. It was felt that, in their general 
desire to give the most favorable impression, they would not 
differentiate between the more and less valid items and that there- 
fore the mean change would be similar for items of all levels of 
validity. 

Turning to the second issue, widespread response shifts from 
non-selection to selection groups were expected on all items and 
it was thought that these shifts might raise or lower item validities 
by chance. But it was hypothesized that there would be some 
tendency for the larger changes in item response in either direction 
to be associated with the greater reductions in item validities. 
Generally, then, change in mean response would be related to 
change in item validity only to a small extent. 


SUBJECTS 


Members of two successive freshman classes at a women’s liberal 
arts college of an eastern state university were the subjects for the 
study. There were three hundred fifty-five women in the class of 
1951 and three hundred forty-five in the class of 1952. 


PROCEDURE 


For several years the Educational Testing Service has been 
attempting to develop a questionnaire which would measure non- 
intellectual factors associated with academic achievement in col- 
lege? The study, supported by the College Entrance Examination 


* The development and validation of this questionnaire over & three-year 
Period are reported elsewhere in two papers (4, 7). e 


¢ 
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Board, has been directed primarily at evaluating characteristics 

of women entering college. A questionnaire of three hundred twenty- 

eight items was constructed and administered to the three hundred 

fifty-five members of the freshman class, the Class of 1951, as a 

group three weeks after their arrival at college. One hundred and 

ty-five,of the same items were arbitrarily selected for reproduc- 

if tion in a booklet which was administered individually to applicants 

for the next year’s freshman class, the Class of 1952, as a part of 

the normal application for admission; three hundred forty-five 

women were admitted to this class. Thus, members of the first 

m class responded after they had been safely admitted to the college, 

whereas respondents in the second group were seeking admission 

4 to college. These are the postadmission and preadmission groups, 
respectively. 

The one hundred and forty-five items in the questionnaire were 
of the usual inventory type. For example, one item asked: “Do you 
feel that the higher the grades a girl gets in college the more she 
will amount to after college?” The five possible responses were: 
(1) Yes, (2) Probably Yes, (3) Uncertain, (4) Probably No, and 
(5) No. In other items the respondent was to indicate how appeal- 
ing certain topics of conversation might be to her. In some items 
she was to give the degree of importance to her of various reasons 
for attending college. Over all, the content covered the student’s 
intellectual interests, motivation for attending college, and teacher 
relations and study habits. 

Each of the items had four, five, or seven possible responses, 


numbered from 1 to 4, 1 to 5, or 1 to 7. Since this numerical order . 


corresponded to a logical order of responses, a linear response 
continuum was assumed and the score of a person on an item was 
taken to be the number of the response which was checked. Thus, 
a respondent checking ‘Probably No’ in reply to the question cited 
above would receive a score of 4 on the item. 

Using these scores, the average response for each of the one 
hundred and forty-five items in each of the two classes was com- 
puted. The Class of 1951 (postadmission) item mean subtracted 
from the Class of 1952 (preadmission) item mean was taken as an 
index of the change induced in the response to that item by the 

, different conditions of administration. The difference for any item 
could be either positive or negative, depending only on whether 
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the average of the response numbers checked by the Class of 1952 

was higher or lower on the arbitrary response scale than the average 

of the numbers checked by the Class of 1951. It should be noted 

that the same individuals are not being compared under two testing 
conditions here nor are two groups of matched pairs of students. 
Assuming that the two classes were selected in the sanfe manmay , 
from the same general population, however, it is reasonable to 
compare the mean score of an item for one class with the mean 
score for the other class for the purposes of the study. 

The criterion used for validating the items was the so-called 

‘achievement index’, which was derived for each individual by 
taking the difference between her actual Freshman-Year College YB 
Grade-Point Average and her most probable Grade-Point Average 
predicted from the two scores of the College Board Scholastic 
Aptitude Test. (The multiple correlations of the Scholastic Apti- 
tude Test verbal and mathematical sections with Freshman-Year 
Grade-Point Average were .51 in the Class of 1951 and .52 in the 
Class of 1952.) The product-moment correlation of each of the 
one hundred and forty-five items with the criterion was obtained. 
It was possible to calculate two such validity indices for each 
item, one in each class. It should be noted that since the items 
were of the common inventory type there was no ‘right answer’ in 
the sense of factual items and so there was no expected sign for 
any validity coefficient; either negative or positive values might 
be equally satisfactory and both were found in the present case. 

Using the mean response differences and item validities de- 
scribed, two kinds of analyses were then carried out: 

(a) Each item validity index in the Class of 1951 (non-selection) 

Was plotted against the difference between the two class mean re- 
sponses for that item; a similar plot was made with the Class of 
1952 (selection) validity indices. 

(b) The change in item validity from one class to the other was 
Plotted against the mean item response difference. In considering 
the change in item Validity a rise in the direction of the non-selec- 
tion group’s value (Class of 1951) was taken as a positive change 
ot ‘tise’ in validity, a drop toward or beyond zero in the other 
direction as a negative change or ‘drop’. 

Since items of four, five, and seven choices were included, plots © 
were also made substituting the critical ratio of the bier ts d 
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the mean item response difference. The item validity r’s were not 
transformed into z’s because all of them were low and so in the 
range where the transformation has little effect. 


RESULTS 


» For theresults, the two kinds of analyses will bo considered sepa- 


a=) 


rately. First, on the problem of item validity and item response 
change, the correlations computed from the scatterplots were as 
follows: 
In the Class of 1951 (Postadmission) 
Item validity vs. mean item response change: +.13 
Item validity vs. CR of mean item response change: +.13 
In the Class of 1952 (Preadmission) 
Item validity vs. mean item response change: +.21 
Item validity vs. CR of mean item response change: +.24 
The positive sign means that the largest item response changes 
in the direction indicated as desirable by the sign of the item valid- 
ity coefficient are associated with the items of highest validity. 
Assuming that all the conditions underlying use of the common 
formula for the standard error of a zero correlation are fulfilled 
here, the N being one hundred and forty-five items, the two values 
of +.13 do not differ significantly from zero in the statistical sense, 
the +-.21 almost reaches the .01 level of significance, and the +-.24 
does reach the .01 level. Although all four coefficients cited are 
positive, the pertinence of this agreement from the standpoint 
of statistical significance is somewhat reduced by the fact that the 
coefficients are by no means independent of one another. 
As a matter of interest, the correlations using the critical ratio 
in an attempt to adjust for the variations in number of possible 


*The means and standard deviations for the four variables involved 
were: 


D 
Item validities (Class of 1951)......................... ve “06 
Item validities (Class of 1952)............. A -00 „06 
Differences between mean item responses......... ...| +0.05 | 0.35 
CR’s of differences between mean item responses....... +0.57 | 3.99 


A note should perhaps be added to clarify these item validities statistics- 

The item validities were distributed approximately symmetrically around 

zero, both positive and negative values being found as mentioned above, 

so the means of the item validities were close to zero as reported here. The 

le items, of course, would be those with the highest validity co- 
nts regardless of sign. 
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responses do not differ essentially from those using the straight- 
forward mean response difference. Also, the differences between 
comparable coefficients from the two classes are neither large nor 
statistically significant. 

When the scatterplots for the four correlations were examined, 


the picture seemed%o be a very common one with no suggestion™™@q a 


of a curvilinear trend of any kind. 

Turning to the second problem, the correlation between drop in 
item validity from the post- to preadmission class and change in 
mean item response was +.01. Substituting the critical ratio for 
the mean response difference resulted in a correlation of —.02, 
Thus, no association was found between decrease in item validity 
from one class to the other and change in mean item response. Here, 
too, examination of the scatterplots revealed only the common 
ellipse; a nonlinear relationship was not uncovered. 

An interesting supplementary point arose out of the data in- 
volved in the two correlation coefficients just mentioned. The 
mean of the changes in item validity was —.03 (the standard 
deviation of the changes being .06). The negative sign attached to 
the mean reflects a general, slight drop or negative change (i.e., 
toward or beyond zero) in the item validities from the postadmis- 
sion class to the preadmission class. How much of this drop was 

due to the unreliability of the item validity statistic and how much 
to the difference in situations is not known. The over-all distribu- 
tion of item validities remained very much the same in the two 
oad as one can see from the summary statistics in footnote 3 
above. 


CONCLUSIONS AND DISCUSSION 


_ With regard to the two problems described earlier, the conclu- 
sions of the study are: 

1) There was only a very slight tendency for the largest item 
Tesponse changes in the direction indicated as desirable by the 
Sign of the validity cdefficient to be associated with items of the 
highest validity. 

2) No association was found between drop in item validity from 
one class to the other and change in mean item response. 

While the results were derived from and apply directly only tothe 
Particular instrument involved, it seems reasonable that they might 
be found characteristic of other similar inventories, self-descri 
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t 
tion forms, and questionnaires in which the items are straight- 
forward and not particularly subtle. Although students applying f 
for college are apparently under some pressure intentionally oF 
unintentionally to give what they consider to be desirable answers — 
to these tests, such changes as occur are related only slightly, if at | 
g all, to he item validities. This suggests that the preponderance of 
student opinion as to which is the favorable end of the scale is 
correct to some extent but that there is great diversity of opinion 
in this respect about all the items. The suggestion from the second | 
conclusion is that those items which are changed most from the 
postadmission to the preadmission situation are not any more ad- , 
versely affected than are those which are changed least. 

There is some comfort in these conclusions for questionnaire and 
inventory constructors, Even though the intent of this kind of 
item usually appears very obvious to the item writer, it does not | 
follow that the student applying for college will necessarily be able f 
to see through it. In his general effort to paint an attractive picture — 
of himself the student seems rather unable, at least in the situs- | 
tion discussed here, to detect consistently the more valid items s0 
that he may ‘put his best foot forward’ on these more important _ 
items especially. The emphases he injects into his questionnaire 
replies have little relationship to the validities of the items. 

Another implication should be mentioned. The findings of the 
study raise the possibility that we need not necessarily discard 4 | 
questionnaire or inventory for use in selection just because (1) 
the items in the instrument are not subtle and because (2) the 
normal item response for an applicant group is not the same as the | 
normal response for an admitted group. Responses to the identical + 
items may be quite different for the two groups, but many of the 
same items may be operative in differentiating among the members 
within each group. Where it is feasible, any instrument that ap- 
pears to measure an important kind of behavior might well bè 
worth trying even though the respondents are seeking a goal and 
might alter their responses accordingly. The empirical eviden’? 
will provide the evaluation required and proper use of the score 
may still produce valuable information for selection purposes. 


SUMMARY 


attitude-interest inventory was administered to two successive | 
PE ey at a women’s college, in one case shortly afte! | 
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the students had been admitted to college and in the other case as 
a part of the application procedure for admission. The average 
response change from the postadmission class to the preadmission 
class was computed for each item and plotted against the item 
validity for predicting freshman-grade performance relative to 


scholastic aptitude» The average item response change vms alsot> Be 


plotted against the change in item validity from one class to the 
other. There was only a very slight tendency for the largest item 
response changes in the direction indicated as desirable by the 
sign of the item validity coefficient to be associated with items of 
highest validity. No association was found between drop in item 
validity from one class to the other and change in mean item re- 


sponse. 
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INFLUENCE OF SOCIAL CLASS UPON 
PERFORMANCE ON THE DRAW-A- 
MAN TEST è 
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Sociologists and cultural anthropologists are stressing the mh 
portance of the location of an individual in the social-class struc 
ture of his community asa basic determiner of a large proportion’ 
of the cultural and environmental experiences he may have (14, 
15). By limiting the environmental experiences in socialization 
processes, a social-class group tends to restrict the individual's 
learning to a particular social pattern of behaving and believing, 4 i 
pattern in many ways distinct from that of persons belonging to 
(or identifying with) other social classes (4, 12). The differences 


} in cultural experience, together with an individual’s cultural moti: 


BERS nt SE 


= ” 


vation, may be expected to influence the content of what a at 
will learn in school, how he will feel about his teacher, his peers 
his school experience in general (8, 13, 17). These cultural differ 
ences may be expected to affect also the responses of children on i 
intelligence tests (2). The problem of this investigation is to ding 
cover the effect of social class membership upon performance 0 i 
T 


‘ 
k 


certain standardized group tests of intelligence and upon perfor 
mance on the Draw-a-Man Test (1). 
Research on the relation of environmental-cultural factors and 


| intelligence has been carried on by many investigators. Their wotk 


is reviewed by Kenneth Eells (5), as well as by Virgil Herrick (1): » 
“The research is close to unanimous,” writes Herrick, “in showing 
that there are significant differences in intelligence-test perfor 
mance of children and youth from different socio-economic back 
grounds, with children from the higher levels always securing th? 
higher intelligence test scores.” (p. 12). 

A number of studies have shown the effect of environment up% | 
the responses of children on the Draw-a-Man Test (8); among the | 
is the research of Havighurst and his associates (9) concerning th? | 
performance of three hundred and twenty-five Indian children % | 
the Sioux, Hopi, Zuni, Zia, Navaho, and Papago tribes in nil? y 

ee They found that Indian children from these trib 
44 
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than white children on the Draw-a-Man Test, that 
ys from the Pueblo groups did better than girls on the 
, the evidence pointing strongly to the conclusion that environ- 
ment affects the performance of children on the Draw-a-Man Test. 
To study the effedt of social class environment on performance of 


white children on the Draw-a-Man Test is the problem œf this®> gq , 


investigation. 
PROCEDURE 


Subjects for the study include one hundred and two boys and 
one hundred and thirty girls whose social status in their midwestern 
community had been determined by using Warner’s Index of Status 
Characteristics (16). Warner’s index is based upon ratings, on 
seven-point scales, of four different status characteristics—ocu- 
pation, source of income, type of house lived in, and dwelling area 
in the community. These four rdtings are combined into a single 
index (ISC) by means of a weighting scheme; in this particular 
investigation, a part of a study of the University of Chicago (6), 
education of the parents was substituted for the source of income. 
The ISC has been shown to be closely related to social class place- 
ment as determined by actual social participation and social repu- 
tation in the community. 

Standardized verbal and nonverbal group tests of intelligence 
(Henmon-Nelson, Otis Alpha Verbal, Otis Alpha Non-Verbal, 
KuhImann-Anderson) were administered to these nine-, ten-, and 
eleven-year-olds in their public and parochial school classrooms. 
The following year the children were asked to “draw the best 
man you can,” and each of these drawings was scored twice on 
the Goodenough scale (7) by two assistants.! 

The comparisons between high and low socio-economic groups 
of boys and girls on the mean IQ’s and their standard deviations 

_ Were made for the five tests mentioned above. Tests of significance 
Were made between status-sex groups on the Draw-a-Man Test. 
Then product-moment correlations were calculated between IQ’s 
on the various tests and the ISC’s. 
ee ` 
1 The aid of Miss Helen Coburn and Mr. David Twain, graduate assis- 


» Department of Psychology, The Pennsylvania State College, is 
gratefully acknowledged. b 
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TABLE 1.—Mean INTELLIGENCE QUOTIENTS AND STANDARD DEVIATIONS j 
ON INTELLIGENCE Tests ror Hiem anp Low Socio-Economic Grours h 
oF 9-, 10-, 11-Ysar OLD Borys anp GIRLS 


High SE Group* Low SE Groupt 


A Wake | i 
o JD » Test Boy Girls y Boys Girls 


N [Mia] SD [N] Mgt] sp |x [Mge] sp | [Mean sp 


Henmon-Nelson 52 /115.0)14. 467/114. 1/16.3]41/94, 2/13. 4|55]98.2)15.2_ 
Otis Alpha Verbal 54 |106.9) 9.4/66]102.8/10.4/43|96.3] 9.9/56/93.8)10.0 4 
id Otis Alpha Non-Ver- | 54 |104.3/10.2/66/105.3) 9.9|42193.2| 8.9|56/94.5/10.3 
bal 
Kuhlmann-Anderson | 53 |107.5| 8.7/69|108.6/10.1/41/93.0 ioaea ala | 
Draw-a-Mant 55 | 81.0/17.6/69] 89. 4/15. 2/43/82. 1/13.6/58/83.2/12.6 
ba Me F 
* ISC’s 5-15 inclusive, f 


t ISC’s 21-27 inclusive. : 

t Calculations were made to determine the statistical significance o 
differences between status-sex groups of mean IQ’s on the Draw-a-Mal — 
test; all three differences involving the high status girls were significant. 


RESULTS AND DISCUSSION 


The results of the comparisons between high and low socio-eco 
nomic groups are shown in Table 1. The high status group consists — 
primarily of upper-middle and upper class boys and girls, and the 
low status group contains lower class children. It will be observed _ 
that both the high socio-economic group and the low socio-ec 
nomic group were further divided into boys and girls. For both 
boys and girls of the high status group mean IQ’s on the Henmot — 
Nelson, Otis Verbal, Otis Non-Verbal, and the Kuhlmann-Andet- ) 
Son are all above 100, the highest being the average IQ for thè / 
boys, 115.0. In contrast, the average 1Q’s for boys and girls of the f 
low-socio-economic group all remain under 100, the highest in this 
case being the girls’ mean Henmon-Nelson IQ 98.2. These results | 
are very similar to those reported by Eells, of course, since oW 
study population was a part of his, He found that the differenc® 
between his status groups, ranging from eight IQ points to twenty 
three IQ points, were statistically significant. Our results are i” 

f terpreted here to mean that clearly there are differences betwee? 
>? | the low and high socio-economic groups as to their performant? » 

these widely-used intelligence tests. Moreover, as commonly 
herved, the differences favor the higher status group. Í 
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With regard to the mean Draw-a-Man IQ’s for all four groups 
compared, they were definitely lower than the others; they ranged 
from 81.0 for the high status boys to 89.4 for the high status girls. 
Just why these average IQ’s should be so low as compared to those 
of the other standardized tests is uncertain. There is, of course, 
the possibility of he pupils’ not being motivated adequately ey < 
enough to score well on the Goodenough; in gaining rapport in 
the classrooms the testers who aided us by administering the tests 
in the larger research project (6) endeavored to put the pupils at 
ease, The result may have been that they relaxed their efforts and 
felt that this was not a test but was designed simply to pass the e 
time. Consequently, the results may be spuriously low. It should 
be pointed out, however, that in some instances the variability of s 
the groups on the Draw-a-Man test, as shown by the standard 
deviations, is larger than on other tests. 

Tests of the significance of differences between mean Draw-a- 
‘Man IQ's revealed that all those involving the high status girls 
were beyond chance expectations, those girls’ average IQ being 
significantly higher than others. In Havighurst’s investigation (9) 
involving the Indian children, a contrasting sample of Midwest 
white children was obtained; within that group there was a slight 
superiority of girls over boys on the Draw-a-Man Test, though the 
difference, they say, may have been due to chance. 

Further average IQ’s on the Draw-a-Man were calculated and 
are shown in Table 2. The difference between the mean IQ of the } 
total high status group and that of the total low status group was/ 
not statistically significant; the difference between the mean 1Q’s| 
of all boys and all girls was significant at the one per cent level. 

Table 3 shows the results of correlation analysis of the IQ scores 
on the various tests with the Index of Status Characteristics. The 


TABLE 2.—Mean Draw-a-Man IQ’s anp SD’s Setecrep GROUPS 


Group N Mean IQ sD 
High S-E group 124 85.6 16.5 
Low 8-E group 101 82.7 13.1 
All boys 98 81.5 15.5 
All girls* 127 86.5 14.5 


S i The difference between mean IQ’s of all boys and all girls was statjs- 
tically significant. b 
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TABLE 3.—Propuct-Moment CORRELATION COEFFICIENTS OF ISC’s wim | 
1Q’s on Various Tests 


Test N rand PE; 


j 222 51 + 
J226 | <42 + 
5 
5 


Henmon-Nelson. . 
» Otis Alpha Verbal..... 
g P otis Alpha Non-Verbal. . 223 
Kuhlmann-Anderson....... seee] 228 fs 
Goodenough Draw-a-Man..............00-.2200005 232 ll + 


22222 


D four coefficients involving the Henmon-Nelson, Otis Verbal, Otis 
Non-Verbal, and the Kuhlmann-Anderson are all moderate, 1's 
ranging from .42 to .53. Each r is statistically significant, being 
larger than four times its probable error. The higher the social 
status, as measured by the ISC, the higher is the intelligence, as 
measured by these IQ tests. This tendency is in conformity with 
other investigations concerning the relationship between intelli- 
gence and socio-economic background. 

The coefficient showing the correlation between the response on 
the Goodenough Draw-a-Man Test and the ISC is .11 with its 
probable error indicating that this small degree of relationship is 
unreliable. In contrast, then, with the previous tests, this shows 
little or no relationship between the Draw-a-Man IQ and socio- 
economic status. Performance on the Goodenough scale does not 
appear to be influenced by the children’s membership in a particu- 
lar social class group. This finding may be substantiated by data — 

+ in a study of ten-year-old white children by Havighurst ant g 

À Janke (10), whose data do not lend themselves to rigid comparison — 
with ours. In the midwestern community they studied, some dif- 
ferences were revealed between the status groups on the Good- 
enough scale. In the small samples they compared on this particulat 
test, the mean IQ for lower-lower class children was 91; for lower 
middle class children the mean IQ was 107, the differenbe between 
these socio-economic groups being 16. 

One of the assumptions which test constructors utilize is that 
all subjects taking the test must be equally familiar or equally uw 
familiar with the test materials. In fact, Goodenough’s selection 
of a man as the object of this test was on the basis that all children 
(presumably all children in American cultures) have had approx 

gy equal opportunities to observe the details of the huma? 
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“figure on which scoring was standardized, so that resulting varia- 
tions could be said to be due to discrepancies among children in 
their capacity to make use of such opportunities, rather than to 
‘their not having had such opportunities. 

A legitimate h}tpothesis might be to the effect, however, that on 


the Draw-a-Man West children of higher socio-economig statugs, ü 


would perform somewhat better than lower class children. This we 
felt might be true because the higher status group would probably 
have had more experiences with drawing materials and with pic- 
tures of people in books and magazines; their relationship with 
school teachers would probably have been more positive, and the 
degree of their motivation to do well on the test higher than that 
"of the lower class children. Moreover, the attention most middle 
and upper class parents give to details of clothing, personal neat- 
ness, ete., might be expected to influence their children’s aware- 
mess of specific details which in turn might increase appreciably 
- their scores on the Goodenough. 
_ This difference in performance by social class groups, we might 
' further hypothesize, would be less than with the other tests ana- 
lyzed because of the almost complete non-verbal nature of the 
Goodenough and its administration, as compared with tests using 
verbal and picture-type items, which have been shown to favor 
often-times the upper status groups. i 
Jt would appear that on the basis of our data, social class en- 
Vironment has not affected to any significant extent the responses 
on the Draw-a-Man Test; but the differences which are apparent 
between the correlation coefficients of the Henmon-Nelson, Otis 
Verbal, Otis Non-Verbal, the Kuhlmann-Anderson, and the Draw- 
aMan do appear to substantiate the latter hypothesis on the 
Telative influence of social class. It is believed that the nature 
of the test materials themselves is the largest contributing factor. 
A Significant difference between performance of low status groups 
‘and higher status groups on the Draw-a-Man was not revealed. 


SUMMARY AND CONCLUSIONS 


_ In this paper we have summarized findings of a study of the 
Performance of some elementary school children on certain widely 
Used intelligence tests. Having obtained their families’ socio-eco- 
nomic status ratings in their midwestern community, by means of 
Warner’s Index of Status Characteristics, we were able to com; 
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| their IQ scores by social class. Further, having administered and p 
_ scored Goodenough Draw-a-Man Tests on these children, we were 

“able to show the relationship which existed between performance l, 
|} onthe Draw-a-Man Test and social status. This sociation proved 
‘to be negligible; the relationships between soci#{ status and 1Q’s 1 
gyn the Henmon-Nelson, Otis Alpha Verbal, Otir/Alpha Non-Verbal, f 

2 { and the Kuhlmann-Anderson were moderate but statistically sig- f 

| nificant. f 
| The widespread use of intelligence tests as selectors of human | 
~ abilities—as instruments to help educational counselors in guiding ' 
D students and to aid industrial and military leaders in selecting | 
personnel—is well-known. We submit that the optimum utilization 
of human resources, actual and potential, in all socio-economic 
groups is an obligation. The search for measures of intelligence 
which resolve social-class differentials must, therefore, be con- 
tinued; questions of motivational factors which influence the per- 
formance of children on tests should receive concerted attention 7 
along with research on the content of tests themselves. f 
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West Lafayette (Indiana) Public Schools 


Recently public school systems have given considerable atten- 
tion to the physical environment of the school classroom. Tra- 
ditional color schemes with their preponderance of dull shades of 
gray and buff have given way to brighter and more cheerful decora- 
tive motifs. Woodwork, floors and furniture have changed from 


' traditional dark oak to lighter shades of sand or bleached finishes. 


Rigid desks in straight rows have been replaced by movable desk 
and chair combinations which can be adjusted to the physical 
requirements of the pupil. Control of both natural and artificial 
light is distributing the quantity of available illumination more 
evenly and increasing its usefulness for each pupil in the class- 
room. 

In the Morton Elementary School, West Lafayette, Indiana, 
a series of research studies was undertaken to determine the effects 
of these new classroom designs upon the welfare of pupils who 
spent a school year in this new environment. The room design 
used followed that of the ‘codrdinated classroom’ recommended 
by Harmon. 

Two classrooms were refinished in accordance with these more 
modern developments. Three major changes were undertaken. 
First, the distribution of light was rearranged so that the quantity 
and quality of light was distributed equally, as nearly as possible, 
over all portions of the room. Care was taken to see that at any 
seat in the room adequate light was available for the critical 
recognition of visual detail ms, fa full and free performance of 
visual tasks, bearing in mind such visual tasks are not confined 
to a single plane but are three dimensional in character. An opti- 
yn distribution of light was obtained by installing semi-trai 
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_ parent baffles at an angle to the upper half of the windows in such 
manner that two thirds of the light entering through the window 
was reflected onto the ceiling and one third was transmitted in the 


normal manner. The baffles were set at such an angle that the 


reflected light figm the ceiling was directed to the normally dark 
sides of the roomMfurthest from the windows. The ceiling and ag, 
- eighteen-inch strip at the top of the walls were painted white to s 
aid in this distribution of light. Natural light was supplemented 
by artificial light also designed to be distributed equally over all 
parts of the room. For this purpose florescent fixtures were installed 
= in reflectors so designed that the light was evenly distributed. e 
| ‘The second consideration in the refinishing was the amount of 
} contrast between the visual task presented to the child and its 
surrounding. It is desirable that this contrast be reduced markedly 
TA jm that which occurs in the normal classroom and preferably be 
Not greater than three to one. Since most schoolroom taks involve 
E gt with white paper, the desks in the experimental room 
oe finished in a light wood finish which reduced markedly the 
Contrast between the paper on which the child was working and 
the surrounding desk tops. For the same reason the finish on the 
a ‘floor was light. Dark woodwork was replaced by light finish and 
‘Pastel shades were used in the painting of the walls. Black chalk- 
“Y boards were replaced by yellow green chalkboards to further 
n teduce undesirable contrast. 
"The third change involved the furniture in the room. It is 
‘desirable that such furniture permit freedom of movement to the 
child and be adjustable both to the physical size of the child and 
to the task at hand, The American Seating Company ten-twenty 
desk was chosen and was used to replace the previous furniture. 
3 ‘These desks were not placed in the customary straight rows but 
" Were arranged in slightly curved rows so that as nearly as possible 
 & fifty degree angle between the plane of the windows and the 
- child’s work was maintained. Such an arrangement minimizes 
"the undesirable shadows which fall on a pupil’s desk. 
Two additional rooms identical to the experimental rooms, as to 
location on the side of the building, number of windows, eto., 
Were chosen as control rooms. Hor each maintenance operation 
| Which was performed in the experimental classroom, & similar 
Operation was performed in the control classroom. When the 
experimental classrooms were painted, the control classrooms 
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also painted but the traditional color scheme was followed. When 
new furniture was provided in the experimental classrooms; new 
furniture, but of a more traditional design, was provided in the 
control classrooms. Similarly, with all other necessary alterations 
in the experimental classrooms, a similar amount of work, material, 


a wd mongy was put into the control classroom As a result the 


control rooms were just as new, just as clean, and just as completely 
refinished as were the experimental rooms. The only difference 


| 
: 


was that the scheme followed in refinishing the control rooms was I 
traditional whereas the scheme followed in refinishing the experi- ; 


mental rooms was that of the codrdinated classroom. 

One experimental room and one control room were designed to 
be occupied by fourth-grade classes. The second experimental room 
and the second control room were designed to be occupied by 
fifth-grade classes. At the beginning of the school year, all students 
enrolling in the fourth grade were assigned at random to either an 
experimental or a control room. Similarly all students enrolling in 
the fifth grade were assigned at random to either an experiméntal or 
a control room. By this procedure, two randomly selected groups 
were obtained at each grade level, one assigned to the experimental 
room and one assigned to a control room. This random selection 
resulted in two groups in each grade which did not differ signifi- 
cantly in educational achievement level or 1Q. 

No attempt was made in this series of experiments to evaluate 
all of the possible effects of the experimental room environment 
upon school children. Similarly no attempt was made to investigate 
all of the hypotheses which have been advanced in support of the 
codrdinated classroom. Rather, three general areas of investigation 
were chosen which, it was thought, were important to the welfare 
of school children. The three areas selected were school achieve- 
ment, posture, and frequency of nervous habits. An experimental 
design was developed in each of these three areas which it was 
felt would throw light upon the question of differences between the 


experimental and control groups. The results of these three experi- 
ments are reported here. 


EDUCATIONAL ACHIEVEMENT 
In investigating the variable of school achievement, scores 02 
the Stanford Achievement tests, intermediate examination and 
tijHenmon-Nelson test of mental ability were used. Only the two 
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- fifth-grade classes could be used for this experiment since the tests 


administered at the fourth-grade level were not comparable. 
These results are therefore based on twenty-three children in the 
fifth-grade experimental class and twenty-four children in the 
fifth-grade cnt classroom. 


For each chil the accomplishment quotient was computes, N 


This quotient evaluates educational achievement in terms of 
both chronological age and mental ability and, therefore, represents 
achievement corrected for the mental ability level of the pupil. 
These quotients were then averaged for each of the two classes 
separately. For the experimental group, the mean accomplish- 
ment quotient was 92.1. For the control group, the mean accom- 
plishment quotient was 85.4. The difference between the two 
groups was 6.7 in favor of the experimental classroom. This dif- 
s was found to be statistically significant with a t-value of 
48, 

. From these results it is seen that the educational achievement 
as measured by the accomplishment quotient was greater in the 
class which spent the school year in the experimental environment 
than in the case of the class which spent the school year in the con- 
trol environment. The difference between these two classes was 
significant at less than the one per cent level of confidence. 


POSTURE 


In investigating postural differences between the two groups 
of children, only one aspect of total posture, head-tilt, was con- 


sidered. A head-tilt is defined as a tendency on the part of the . 


child to hold his head tilted noticeably to either one side or the 
other when he is standing erect and performing a visual task 
which is located at eye level and twenty feet in front of him. 

To measure this variable, a photographic technique was used. 
A 35 mm. camera was mounted on a tripod and placed directly in 
front of the child but below his eye level. Across the room at a 
distance of twenty feet a Snellen visual acuity test chart was 
mounted on the wall at the child’s eye level. The child was brought 
into the photographic room and was shown a circle eighteen inches 
in diameter drawn upon the floor with chalk. The Snellen chart 
across the room was pointed out to him, and he was instructed 
to stand in the circle and read the letters on the chart. No further 
instruction was given as it was desired not to influence theg- 
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dividual’s posture by the instructions. When the child had read 
down the Snellen chart to the point where the letters became 
difficult for him to recognize, a full length front view photograph 
of him was taken. ; 
When these photographs had been developed anM printed, judges 
Were requested to determine whether or not a hedd-tilt was présent 
for each child. The instructions to the judges were to identify 
the midline of the body by observing the sternal notch and a 
point midway between the knees. They were to observe the midline 
of the face by determining a point midway between the eyes and | 
h 


the point of the chin. If the midline of the body and the midline of 
the face coincided, the judge was to indicate that a head-tilt 
was not present. If the two lines formed an angle, he was to indicate 
that a head-tilt was present. 

From an observation of the photographs, it was apparent that 
the measurements represented a bimodal curve. If a head-tilt was 
present, it was present to a marked degree and it was clearly recog- 
nizable. There were extremely few cases in which the head tilted 
so slightly as to leave doubt in the minds of the judges. There 
appeared to be very few cases of slight or doubtful head-tilt. For 
this reason the measurement of these photographs by the method 
of judgment or rating and the establishment of a bi-category vari- 
ahi ani to be justified and was followed in computing further 
results. 

The reliability of the rating method for determining posture 
described above was investigated Statistically. In this investiga- 
tion, two problems arose. The first problem was the reliability 
with which different raters would evaluate the same set of photo- 
graphs. To test this aspect of reliability, two raters were asked to 
judge the same set of photographs. Their results were then com- 
pared using the tetrachoric r as a measure of agreement. This 
procedure resulted in an r of .97. 

The second reliability problem involves the degree to which 
the postural characteristics revealed by the photographs would 
remain constant from day to day. To test this problem, a group of 
children were photographed on one day using the procedure de- 
scribed above. On the following day the same children were photo- 

» graphed again using the same identical procedure. A rater was 
E to judge both sets of photographs according to the procedure 
ined above. The result of his judgment on these two sets of 
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data were then compared, using the tetrachoric r as the measure 
of agreement. This procedure resulted in a coefficient of .81, which 
indicates that the postural characteristics revealed by this photo- 


consequently indigate an habitual posture which persists and 


graphing iy ie relatively constant from day to day and 


tends to recur wheviever the child is performing a task sirsilar to% a 


that represented by the present experimental situation. 

When the reliability of the photographic technique and the 
scoring method had been established, the number of children with 
head-tilt was determined for all groups in both grades and in both 
environments. There were thus a total of forty-nine children who 
had spent one full year in the control environment. Of these con- 
trol children, forty-nine per cent displayed a head-tilt when meas- 
urements were taken in May. There were also forty-nine children 

who had spent one school year in the experimental environment. 


Of these experimental children, only twenty-seven per cent showed 


4 head-tilt when measurements were taken in May. There was 
thus a difference of twenty-two per cent between the groups. This 
difference was statistically significant, the ¢ value being 2.18. It 
‘would appear that the experimental environment has a less detri- 
mental effect upon the habitual posture of the child as indicated 
by the presence of head tilts. 


FREQUENCY OF EXTRANEOUS MOVEMENTS 


Subjective observation of the two types of classroom environ- 
ment involved in these studies had suggested that children in the 
control classrooms might show a higher frequency of extraneous 
Movements or nervous habits. In order to investigate this pos- 


sibility, the technique developed by Olsen was used. This technique , 


consisted of an observation of a timed sample of behavior. Olsen 
had constructed a list of the most common nervous habits dis- 
Played by young children. The technique consisted of observing a 
child for a predetermined period of time and noting for each habit 
in the list, the number of times which it occurred during the ob- 
servation period. 
Using Olsen’s list of nervous habits, similar observations were 
made in the four classrooms involved in the present study. In 
, and again in May, an observer went into the class- 
room. With Olsen’s list before him, he observed the behavior of the 
child in the room for a two minute period. Whenever one 
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the habits on Olsen’s list was observed, a tally mark was made on _ 
the scoring sheet. At the end of the two minutes, the observer 
transferred his attention to the next child whom he observed fora 
similar two-minute period. These observations were continued 
until all of the children in the room had been served. The total 
gnumbes of tally marks on each score sheet Was determined and 

a this figure was considered the raw score on the nervous habits 

inventory for that child. The score thus represented the number of 

times that activities listed by Olsen occurred during a two-minute 
observational period. Each of the four classrooms was treated in 
® the same manner. 

In order to determine the reliability of this technique, two ob- 
Servers were sent into each of two rooms and made observations 
according to the procedure outlined above simultaneously on each 
child in the room. The results of correlating the ratings by each 
of the two raters was a reliability coefficient of .93. 

In order to determine any difference between the two types 
of classrooms, the results were treated separately for the control 
rooms and the experimental rooms. In the control rooms, the 
average frequency of nervous habits in the September observations 
were computed. A similar average was computed for the May ob- 
servations. The difference between these two averages was noted. 
This difference was found to be —.63, indicating that the average 
frequency of nervous habits was slightly higher in May than in 
September for this group. This difference however, was not statis- 
tically significant as the t value was only .95. 

Similar averages were obtained for the experimental groups” 
and the difference computed. The difference in these experimental — 
groups was found to be 1.52, indicating that the frequency of 
nervous habits was less in May than in October. These experi 
mental group differences were found to be statistically significant, 
having a ¢ value of 2.71. 

It is thus seen that the children who spent a school year in the 
control type classroom showed no change in the number of nervous 
habits which they displayed in a two-minute time period. On the 
other hand, the children who spent this school year in the expe! 
mental classroom environment showed a decrease in the numbet 

"i of nervous habits which they displayed in a similar two-minute 
time period. } 
git appeared that the experimental environment was having som? | 
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‘effect upon the number of extraneous movements or nervous habits 
which the children displayed. If it can be argued that these ner- 
yous habits are an indication of the level of tension under which 
the individual is working, then it can be similarly argued that 
this tension is less.in the experimental type classroom than in the 
control type classroom. a oe 


SUMMARY AND CONCLUSION 


This study attempted to evaluate the effect upon fourth- and 
fifth-grade school children of a school year spent in an experi- 
mental classroom. The experimental classroom used was that 
recommended by Harmon and known as the codrdinated classroom. 
A group of children who spent a school year in this type of class- 
toom was compared with a similar group who spent the same 
School year in a traditional type classroom environment. Three 
areas of development were studied: school achievement, posture, 
and frequency of extraneous movements or nervous habits. It 
Was found that the group who spent the school year in the experi- 
mental environment was superior to the group who spent the 
school year in the control environment in school achievement as 
“measured by the accomplishment quotient, in the posture of the 
head in relation to the body (a smaller percentage of head-tilts), 
and in extraneous behavior patterns (a lesser frequency of ex- 
traneous movements or nervous habits). In the areas investigated 
in this study, it is concluded that the newer type classroom en- 
Vironment represented in this case by the codrdinated classroom 
tan have a beneficial effect upon school children who spend a con- 
siderable portion of their time in this environment during the 
School year. 3 
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BOOK REVIEWS 


Roger G. BARKER, BEATRICE A. WRIGHT, LEE MEYERSON, AND) $ 


Morum R. Gonick. Adjustment to Physical Handicap and 
Illness: A. Survey of the Social Psychology of Physique and Dis- 
ability. New York: Social Science Researcl’ Council, (Bulletin 
55) Revised), 1953, pp. 440. 


The social psychology of physique and disability is a problem | 


that has obtained in ever increasing extent the interest of both 
research people and workers in the rehabilitation field. Bulletin 
55, the original volume of the Social Science Research Countil 
which surveyed this topic has influenced the direction as well as 
the amount of interest and work on the subject. This appeared in 
1946. Enough changes have taken place since, in the opinion of the 
investigators, to warrant a revised edition. The pattern of organiza- 
tion, selection and sequence of topic considerations is the same 
as the first. The findings are reported in six chapters with the same 
captions, and a bibliography follows. There are changes, and the 
changes are largely in terms of what the investigators consider t0 
be changing trends calling for a reappraisal to serve as a guide for 
still further research and still more realistic and needed applica 
tions. i; 

The outstanding change in this revised edition is in the shift 
away from emphasis on statistical facts and correlations to a larget 
emphasis on the use of human documents. In the revised edition 
there is a definite improvement in literary style. The revision Ï 
more than merely a repair job. It is that, but it is more. The changes 
made are sometimes just in rephrasing of a word, reconstruction 0 
a phrase, and at times a paragraph. The most marked changes a1? 
in the first two chapters: The introductory chapter on methodolog' 
cal problems of somatopsychological research is described in som? 
detail with a much more substantial perspective for comprehensive 
orientation for understanding all the problems than is done in the 
original volume. Also Chapter 2 concerned with normal variatio® 
in physique is presented in a fuller fashion. To a small extent, such 
differences as appear between the original edition and the present 
revised edition are similar in nature to those found in the tw? 
volumes of Middletown by the Lynds: Middletown full of statisti 
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nd with the omission of human factors and Middletown in Transi- 
beginning the book with the Ball family and describing the 
le behind the scenes. The changes here are not that drastic and 
‘marked, but they.are in the same direction of increased significance 
of research and applications rather than a mere listing and tabula- 


tion. The chief changes are in a broader comprehensive oriextation®, 


in defining the field both in increased clarity of concepts used and 
an increased searching for theories of soma-psyche relations, so 
there’s a more frequent searching for and explanation of likely 


explanatory hypotheses. Also revealed in the more recent studies 
is improved technical adequacy of some of the research. These 
‘changes for the better are stressed and influence the revised edi- 
tion, But they also do one other thing—they bring to the focus 
ttention some of the urgent needs that still remain in the field. 
thing is considered related to the problem they are con- 
d with that, in their way of defining it, is “Concerned with 
variations in physique that affect the psychological situation 
person by influencing the effectiveness of his body as a tool 
ons or by serving as a stimulus to himself or others.” The 
st of physical variations which can have significance as listed by 
em are eighteen in number and include size, race, age, Sex, 
cosmetic defect, muscular strength, motor ability, speech defects, 
- Auditory impairments, visual impairments, tuberculosis, heart 
~ disease, diabetes mellitus, rheumatism, leprosy, cancer, orthopedic 
ilities, acute illness. Also included in their consideration are 
some psychoanalytic contributions such as Schilders in considera- 
| tion of the body image in relationship to the self-concept. And 
__ the investigators call attention to one practical phase of the prob- 
_ lem; namely, the urgent need for investigating the effectiveness 
of forms of coping with the problems of disabled workers. 
iy All in all, the investigators made a competent survey of the 
literature in their first effort, and the people who are in the field, 
either in the field of rehabilitating and educating the disabled or 
Social scientists interested in the ‘followup of studies of social 
logy of physique and disability, will find this volume an 
More satisfactory frame of reference for orientation in the 
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Manson B. Smer. Intergroup Education and the America| 
College. New York: Bureau of Publications, Teachers College, 
Columbia University, 1952, pp. 212, $4.25. 


9 
The foundation for this latest volume of the Teachers College | 
p eap8tudien in Education is provided by a detailed and intelligent | 
analysis of the intergroup education programs reported by fifty-six 
colleges and universities in response to questionnaires submitted 
by the author. Each chapter presents the philosophy’ of a certain 
aspect of intergroup education, a summary of policies and practices |” 
of the fifty-six institutions in regard to this aspect, and recom: | 
mendations for planning a curriculum to deal with this aspect. £ 
The organization should be convenient for individual college in- 
structors or committees charged with the task of working out a 
curriculum concerned with desirable attitudes, values and skills 
in intergroup relations. To such committees and teachers the book 
can be recommended. The emphasis on curriculum development 
rather than methodology makes it of limited value to most workers 
in the social sciences. 

Neither the author’s thoroughness nor her command of the sub- 
ject and related areas will be challenged by many readers of this 
volume. Profuse quotations and citations of authority plus a com- 
pletely-thought-through concept of intergroup education leave 0° 
doubt of the writer’s familiarity with the subject. However, this 
style of writing is probably also the major weakness of the volume. 
Perhaps the author felt the necessity to reinforce her arguments 
with the statements of authorities, because she is aware that somè 
college teachers might not see the need for intergroup education. 
Whatever the reason, the style makes for low readability from the 
standpoints of both interest and difficulty, and may discourag? 
some readers, 

University of Oklahoma Wirtam ELLER 


J Burnarp H. Haut, Mary Gancemt, V. L. Norris, Vivinnn? 
Hurones Vat AND Gorpon Sawarsxy. Psychiatric Aide 
Education. New York: Grune and Stratton, 1952, pp. 168. 


P. sychiatric Aide Education is a report of a sevëfi-year experiment 
) ofa training program carried out by the Menninger Foundatio? 
@) coöperation with the Topeka State Hospital. A grant from th? 
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Rockefeller Foundation supplied the necessary funds. The school 

opened officially on October 3, 1949, and closed on June 30, 1952. 

‘A total of eighty-five students were matriculated; sixty of them 
graduated. The story of this project includes a report concerning 

the field of psychiatric aide—its growth and its needs; recruitment 

and selection of students; an evaluation of the curriculum; s 

story concerning its graduates; and an evaluative chapter on recent 2S 
developments in psychiatric aide education. Also included is a 

fair sized appendix, which contains a bibliography and forms used 

or relevant in considering the problem of psychiatric aide educa- 

tion, The forms include application for admission, course outlines, d 
and course evaluations. 

“Allin all, it’s an interesting story written by the staff members. 

Dr. Karl Menninger in his preface describes the significance of this 

project as well as his reaction to the obstacles experienced because 

of prejudice and blind spots in unexpected places. This story should 

be of interest to hospital administrators and those concerned with 

the problem of psychiatric aide education. 

Psychological Service Center H. MELTZER 


“Bown Forges Tarr. Textbook of Refraction. Philadelphia: W. B. 
Saunders Company, 1951, pp. 418. 


This book on refraction by a professor of ophthalmology at 
‘Temple University is primarily intended for a student group pre- 
paring for the practice in the field of eye work. Accordingly, the 
author has sensibily enough selected to write the book in straight- 
forward, at times seemingly dogmatic form, using references that 

are pertinent and readily available. Also included by him are 
; and papers suggested as further reading on the more original 

Material that can be used by more ambitious students or the more 

Curious and inquiring readers. 

_ The text as written presupposes a background of fair knowledge 
of the anatomy and physiology of the eyes and the nervous system 

moe Well as an acquaintance with the principal methods of geometric 
and physiological optics. The early chapters contain discussions 
of the basic material necessary for the study of refractive error 
and ocular motom#inomalies. They also contain a short, elementary € 
Presentation of the more important concepts which are discussed «h 
M greater detail later in the volume. There is a chapter on visi@y 
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that continues with a consideration of preliminary concepts for | 
understanding problems in the field and then chapters are outlined 
that are of anatomical structure or of functions related to the field, 
Illustrative of tOpics included are: ocular reflexgs; anomalies and 
, variations of ocular accommodation; myopia; astigmia; presbyopia 
¿pand aphakia; binocular fusion, and principles and methods in 

heterophoria. 

In all, this is a carefully prepared volume that should serve well | 

* to do what it proposes to; namely, serve students in the field as | 

well as neurologists and psychologists who are interested in having } 

an authoritative volume on ocular refraction. 
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EDUCATIONAL THERAPY y = 
BRUCE McCARTY 


“ Waco, Texas 


This is a report on the Experimental Educational Therapy pro- 
gram which was conducted at the Austin State Hospital from 
2 March to the end of August of 1951. It was cosponsored by the 

Hospital and the Hogg Foundation for Mental Health. 


RATIONALE 


Mental institutions have been thought of historically as places 
"Where unfortunates were kept away from civilization for their 
" mutual protection. Not until comparatively recently has it been 
“thought possible to restore mental patients to a healthy active 
7 tle in society, and with the dawn of this concept, the name of the 
"institution was changed from ‘asylum’ to ‘hospital’. Sad to relate, 
i the practice has not kept pace with the theory. In many instances 
. the professed intention of relieving mental illness and restoring the 
individual has taken a back seat to administrative convenience 
which is often suffused with the file-and-forget philosophy. More 
often than not, various therapies which are in use are merely 
peripheral measures designed more to occupy time than to amelio- 
tate aberrances. 

Pi It Seems likely that even though the etiology of mental disorders 
18 primarily emotional, the patient’s preoccupation with these per- 
_ sonality problems interferes with his intellectual accbmplishments 
and this in turn causes him to fall behind in his own rate of develop- 
ment—in what Havighurst calls ‘developmental tasks’. Though 
these are frequently thought of as being problems of a strictly 
academic nature, such is only part of the story; sundry items such 
fa the ‘Acquisitions of graceful interpersonal relationships, advance- 
Ment in current activities (whether educational, occupational ‘or 
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personal in nature) and Ta, MAM keeping up with the-here- 
and-the-now are no less developmental tasks which must be mas- 
tered in their proper sequence to form stepping stones toward 
future life manipulation. The befoulment of this grderly sequence 
of developmental progress in turn causes an emotional problem of 

@ «8 own.»As the individual sees himself being left behind by the 
onrushing world his urge for inferiority-avoidance reaches near 
panic proportions and the vicious cycle thus created is not mate- 
rially relieved by therapies designed to waste yet more time. 

Clearly, if these two facets of the problem are simultaneously 

> operative they must be attacked simultaneously in order to render 
therapy effective. There must be (1) the traditional psycho-envi- 
ronmental attack on the emotional problem: environmental in 
order to release the patient from the tension causing rigidity of his 
old milieu, and psychotherapeutic to rebuild an ego substantial 
enough to withstand the stresses and strains of life. 

Concurrently, there must be provided (2) the opportunity to 

bring expectations and accomplishments into line. It is hard to 
see how there could be any successful and permanent rebuilding 
of the ego with only the materials which were present from the 
incipient stages of the morbid condition. These were evidently 
inadequate as a bulwark once, and there is little reason to suppose 
them adequate as time and tide go on apace. The addition of tools 
for living is essential, whether academic, professional and/or avo- 


cational; and in its widest sense we call this increment Educational 
Therapy. $ 


CLASS PLAN 


The idea of Educational Therapy for the younger patients of the 
Texas State Hospital was primarily a pilot study for assessing the 
effects of education as a means of therapy for the students, and 
also the effect of these abnormal students on education. 

‘The curriculum was academic- rather than vocational-centered. 
Of course this does not mean that it was built exclusively around 
the three ‘R’s’. A typing program, for instance, radio course and 
art classes were pointed in the direction of specific trades, but the 
emphasis was on the traditional subjects. 

The school was set up in the former office of the beauty sho?: 
‘ Toom was particularly well suited for the purpose in that it 
/ jy» "ght, modern-looking, well ventilated and out of the heavy 
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_ The walls were a combination of stained wood and 
the latter part of which was used as a movie screen. One of 
long tables contained our book collection and three archaic 
ono of which was in such condition that it served only 
capacity of a bookend). The other table was work space. 
arithmetic, art, astronomy, chemistry, civics? Englis} 
, geometry, health, history, physics, radio, spelling and 
etry were provided by the Special Education section of 
Public School System. Other equipment included water 
crayons, white and colored chalk, construction paper, tab- 
ing paper, molding clay, pencils, finger paints, magazines 
ipbook making material. 


LIMITATIONS 


jo often a mental institution is a catchall for everything 
by deems undesirable enough to segregate from itself. In order 
a hospital an instrument of rehabilitation some degree of 
tion must be maintained. If it is necessary, because of 
ies in the legal system, for the hospital to cater to the 
s as well as the abnormals, the hospital must make a 
differentiation between the programs to be followed in treat- 
of these two categories. The purpose of Educational Therapy 
ams is to restore the patient at least to his former level of 
ncy, and, if possible, to increase his level of functioning; but 
be recognized that no educational technique invented has 
ced a missing gray cell. By trying to make one program 
enough to include every form of deviation the net result 


is not intended to relegate the subnormals to the waste- 
t merely segregates them from ‘the group under discussion. 
normals form a group which is custodial in nature, and 
clusion in a dynamic program for readmission to normal 
ill lead only to a blind alley. 
idea of having the school open to voluntary attendance by 
all the hospital patients was advanced but seemed im- 
on at least six counts. (1) The nature of the students 
3 it imperative that the classes be small so that individual 
can be given to the students. On several occasions this 
borne out substantially when it was found that two stu- 
not be given simultaneous instruction on p: 
q 
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assignments. This is not strange in that, perhaps, those who be- 
came students in this school were those who were unable to function 
adequately in their society, thus it is small wonder that they find 
group participation difficult—even on this hmited scale. 

_ (2) Another difficulty which is closely related to the above is 

e defiriite amount of work which an assignment can contain for 
each individual. If any assignment exceeds this amount it is re- 
jected as being too difficult, and the student feels ‘put upon’. If 
the assignment contains less than the optimum it is within easy 
reach of the student which creates for the teacher the problem of 
keeping everyone occupied. With a large class the teacher cannot 
get around to all before they have finished such assignments. 

(8) These students are more insistent than the normal students 
about having their work checked and graded immediately. They 
lack courtesy and tact and will interrupt any activity in progress to 
secure their most recent desire. A large class of this type would 
soon deteriorate into a condition of bedlam. 

(4) Another practical disadvantage to having ‘open’ class is the 
disquieting effect (to say the least) created when the teacher walks 
into a room filled with students about whom he knows nothing 
except that they must be abnormal or they wouldn’t be there. Not 
knowing their educational achievements or their idiosyncrasies the 
eo is at a loss to know how and where to start and the direction 
to take. 

Tf the teacher starts a paranoid too low his pride is injured and 
rapport is extremely difficult to establish subsequently. The same 
amount of coercion-cajoling that can be used successfully with a 
schizoid meets with disaster if used on a hostile organic student. 

A positive program of admission to the school which was worked 
out and adhered to circumvented many of these unfortunate situs- 
tions. Opportunity for the teacher to brief himself on each cas? 
before the student was admitted to the classroom saved confusion 
and embarrassment in all directions. 

(5) Equipment, books especially, are taken to the wards at will 
because it is felt that this permission gives the students a sense of 
belonging and being accepted. The free flow of students, howevels 
makes this arrangement obviously inadvisable. 

(6) Several of the students are interested in making the class 2 
meeting place for themselves and their objets d’ amour, and little 
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or no academic ambition enters the picture. They easily become a, 
disrupting influence for, if they are put to work, they take the 
limited facilities from those who have genuine academic aspira- 


tions, and if they are not put to work, they undermine the entire 


enor of the class with their erotic activities. 
Thus it was felt to be in the best interests to limit the,studentgy, 
‘in the class, and to this effect a screening criterion was set up. It 
might be noted that as things turned out this standard was fre- 
quently honored in the breach. Four categories of patients were set 
aside as being inadvisable: (1) those of advanced age. These were 
sufficiently removed from school by virtue of their years that it 
was felt their inclusion in school would tend to be a regressing 
factor rather than a normalizing factor. In the case of the younger 
patients, even though their academic progress might be slight by 
‘comparison to outside standards, it was felt that for them to en- 
age in an activity which resembled, at least to some degree, their 
concept of normal outside work would be in some measure thera- 
| (2) Those currently engaged in a course of shock treatment were 
eliminated also. The advisability of this was subsequently under- 
lined by the exception. One girl who was engaged in shock therapy 
disrupted classes thoroughly; it was impossible to arrest her atten- 
tion even for short intervals, she could remember none of the names 

-of the classmates (even though her avowed true love was among 
them) and in general she macerated academic progress. 

(8) Those with organic involvement (brain damage), especially 
of the progressive types, were placed on the nonpreferred list be- 
Cause it was thought that their inclusion with others of greater 
Capacity would be beyond the point of diminishing returns with 
respect to their administration. There were three, however, who 
Were admitted to the class: one trauma and two lobotomies. The 
trauma case did little of academic interest and was much more 
Concerned with sex relations than arithmetic. One lobotomy made 
Great strides toward regaining proficiency in geometry, but the 
other was on shock treatment also and the summative effect was 
ea was considerably beyond the pale of advisability for our 


(4) The psychopathic personalities were frowned upon mostly 
of their anti-social activities. When all is said and done, 
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it is still necessary to maintain some sort of order in the classroom, 
To maximize the effectiveness of the program, it was thought ad- 
visable to minimize the number of psychopaths. 


AIMS AND PURPOSES 


@g/Educagion as therapy upholds both the principles of modem 
education and the tenets of good mental health; for both are based 
on the same concept—capitalization on individual differences—but 
neither science escapes modification by the union. Our purpose is 
not purely education. If it were this experimental project would 
be unnecessary; any typical classroom situation would suffice. 
Our students were frankly abnormal—beyond the limits of toler- 
ance in normal classrooms; so it seems advisable to define the aims 
and purposes which guided the experimental class. 

1) To provide basic academic skills for the student to use to facilitate 

his future adjustment. 


For anyone to succeed in this day as even a somewhat autono- 
mous creature, he must have some control over his environment. 
Tt is held to be almost patent that one who has attained a chrono- 
logical development far in excess of one’s academic progress is at 
a distinct disadvantage in one’s dealings with inanimate objects 
and especially with interpersonal relations. If there is to be any 
degree of rehabilitation for these young students, it will necessarily 
follow that they compensate in some measure for their scholastic 
deficiency incurred as a result of their mental illness. 

The likelihood of their acceptance into and their employment 
by society progressively declines with retardation in academic 
Progress. Who, in his right mind, would employ a patient for any 
sort of desirable position if the patient could not read or write or 
tell time? If the patient is destined for nothing more than custodial 
care for life, these things become of trifling importance for any 
environment which is equipped to care for him is equipped to make 
allowances for his inabilities. If he is to become a responsible citi- 
zen, he must meet the basic standards. 

Intellectual performance by an individual suffering from 4 
mental condition is almost always impaired, and the encroachment 
is usually spotty and uneven. The students themselves are unaware 
of their deficiencies; so it must be up to the teacher to ascertain 


) 25 lacks (and some of these show up only after considerable time) 
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ide, basic personal skills for the student to use to facilitate 
future adjustment. 

s that one of the first things to go when an individual 

y mental illness is his will to make himself a part of ce 
‘and neatness. He lets his clothes become dirty, dis- 
tagged; his hair and teeth go unmolested by comb and 
school work and letter-writing go to seed; and water 
little concern to him since a great portion of that which 


a compulsion toward neatness which distinguishes our 
Jass from almost every other subculture on earth. From 
nt of view of the world outside the hospital a degree of 
neatness not commonly found among hospital populations 
a prerequisite to satisfactory adjustment. 

copnized that a superficial veneer of tidiness will do little 
d amelioration of the basic disorder, but it is felt that gentle 
g in that direction, setting examples for the students and the 
ognition of neatness as a symptom of normalcy will do 
toward reinstating the student’s self-respect when 
ore pressing problems are resolved. 
to be considered also the hygienic value of cleanliness 
concomitant morbid mental state that frequently goes with, 
ated by, a microgenic disorder; but these are so com- 

pted that they need only be mentioned. 


vide accredited academic work on primary and secondary 


Jass was fortunate in having the coöperation of the Austin 
hool System in the accrediting of work done in school; 
H ', per se, is important to the student as well as to others. 
student it provides some tangible goal for him to strive 
milestone which he can use to signify a definite level 
nt to satisfy himself that he is not merely vegetating 
s stay in the hospital. A ý 
establishments which would be prospective employers of 


s have no program for testing the proficiency of & 
e 
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students along these lines; so the symbol of proficiency, the di- 
ploma, frequently becomes the universal, critical key to the future, 


4) To provide information which will serve as a kasis for their dis- 
crimination in introspection. 


> Be 1, has*been noticed that some of the students realize that they 
have had hallucinations, and the only basis for discrimination be- 
tween facts and artifacts is their fund of information. The realiza- 
tion that tinnitus is a universal occurrence and not an indication 
of malady came as a startling revelation to one student. Such un- 
derstanding of natural phenomenon and the significance of intro- 
spective observations will go a long way toward minimizing 
misinterpretation and should increase the student’s grip on reality 
as he begins to gain some insight into himself. 


5) To occupy the student constructively. 


Tt has been said, “No wonder epileptics deteriorate: they have 
nothing to do but sit around all day and watch each other throw 
fits.” In the same vein, residents of mental hospitals who have 
nothing to do but contemplate their symptoms and vegetate in the | 
behavioral fertilizer of other psychotics can hardly be expected to 
derive much benefit from their contacts. Educational therapy is 
designed, at best, to provide goals and incentives to reach those 
goals—at least, to give the students something to keep them from 
perseverating on their own morbidity. 


6) To provide activities and entertainment which are designed to keep 
the student in contact with his concept of normalcy. 


It has been noted that some of the students who gain a bit of 
insight into their conditions identify themselves with the abnormal. 
For these students to reach a high degree of normal functioning 
while isolated in the hospital requires a bootstrap sort of effort. 
They must identify with normalcy without being exposed to it. 
The process of deterioration may be nothing more than an internal- 
ization of the lax and permissive attitudes of the institution. 
Abandoning hope for release and realizing that he need have 20 
rigid standards to continue life in the hospital, the student gives 
up the struggle. 

Educational therapy is designed to bridge the -gap betwee? 
BMnaley with its intolerance and hospital permissiveness. It 
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brings the ‘outside’, non-threateningly, to the student for his ac- 
ceptance. 
7) To provide opportunities for social interaction. 
Tn addition to identifying personally with normalcy an important 
goal is that of identifying with society as a collection of ifidividums. 
_ The pseudotolerance which is often exhibited by mental patients 
toward the behavioral peculiarities of others is not based on ac- 
“ceptance of them, but rather on a wide separation of their respec- 
tive inner worlds. When these separate worlds of action are brought 
into juxtaposition, it is usually observed that they react in a char- 
acteristic short-sighted, egocentric manner. 
If the students could be brought to some degree of realization 
that one can make allowances for others without loss of status all 
~ would be well; but if this is impossible, it might still serve a valu- 
F able purpose if one were taught merely to inhibit one’s aggressions, 
thus obviating the vicious cycle of threat and ego defense. It was 
found possible to use the rapport which was established to circum- 
Vent some threatening unpleasantness. It is felt that as esprit de 
_ orps develops that social pressures will come to mean more in 
y shaping their activities. 

Tn order to develop this feeling, activities which set the students 

apart from the rest of the hospital population were planned. They 

included movies on a wide variety of subjects, student-selected 
field trips such as the museum and Barton Springs (one student 
caused a furor when he announced to the life guard that he was 
nine feet tall, weighed three hundred and ten pounds and was the 
best looking man on the grounds). 

There is also in class an opportunity for students of opposite 
sexes to meet in supervised but permissive atmosphere. Such con- 
stitutes experience which should be in the repertoire of the student 

he reënters society. 


8) To provide opportunities for success. 


Tis so important that the students be made to feel successful in 

their activities that rather than being merely a technique for 
other aims, this is in itself a goal. There is an effort made 

PN to assign any task which is beyond the realm of attainment 
or the student. He is given no deadlines for its completion and 
much commendation is rendered when it is finished. If it idly 
e 
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partly correct, it is considered correct with an explanation of the 
alterations necessary. If it is incorrect in its entirety, no blame or 
censure or disapproval is indicated, especially when the student 
seems to be chastising himself, and encouragement And reassurance 
is always abundant. i 

rades”are optional with the student. Some prefer them and 
others do not, but in all cases a pittance of work nets a superlative 
grade. Seldom are they less than 95, and when work is corrected 
only the second mark, 100 is given. 

Tt has been noticed that these students frequently prefer dull, 
routine drill. This seems to be a striving for adequacy and security, 
and it must be catered to before the desire on the part of others 
for a greater academic achievement. It is hoped that when they 
feel a genuine security the students will show spontaneous ten- 
dencies toward exploration of unknown fields and diversification. 


9) To provide an accepting atmosphere. 


An accepting atmosphere is indispensable to rapport, to a feeling 
of non-threat, and hence to a facile learning situation. The key to 
this seems to be in quiet, interested, unemotional receiving of any 
novel and startling innovation in student behavior. All the eye- 
brow-raising details are greeted without panic, and they are referred 
to thereafter only by the students. Quite naturally, in this popula- 
tion there is a limit beyond which aggression and negativism 
cannot be tolerated, but, happily, these limits are seldom reached. 

On ‘those days’ students are allowed to read comics, just sit and 
stare, return to the wards, sleep under the tables or whatever they 
feel like doing. They are asked frequently if they have suggestions 
and practically any spontaneous activity is included in the cur- 
riculum. 

Competition, whether it be between the students or between 
school and students, is not sponsored. Each student works on his 
particular assignment and is paced by his own ability rather than 
any frantic sense of academic inferiority-avoidance which is 5° 
frequently used to motivate normal students. 

To increase the atmosphere of the school and diminish the hos- 
pital influence the students are referred to as such and never 28 
patients. 

Asi the students seemed to get nowhere and some used this 
a 
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as a loafing device throughout, but most seemed to 
limitations would allow. 


data which may shed light on these conditions, their 
on, education and perhaps their prevention. 


| the terature on the education of exceptional individ 

ns to be a paucity concerning mental disorders, perhaps 
k of contact with truly abnormal people in the average 
environment. Hypotheses which have been made and 
therapy has been based can be evaluated only by com- 


ied a daily log of events and attitudes, a collection of 
and other miscellaneous items: deposits for the future. 


z RESULTS 


met on one hundred days from April through August, 
enrolled eleven students whose attendance ranged from 
Aa days. One boy who attended class only one day 
ted. 
progress was generally better than had been antici- 
The most outstanding advance was made in typing. The 
progressed from an incomplete knowledge of the keyboard 
) aphs of mistake-free transcription done rhythmically 
romise of speed. 
showed considerably less improvement. One had hope of 
the Arabic numeral system from 1 to 12 preparatory to 
o tell time. He managed nine numbers in only one hun- 
hich was in reality an accomplishment. 
dents made notable progress, either in new subjects or 
g departed abilities. One who showed great promise 
m and mothering confines after only four days of at- 


ers showed somewhat less progress, two of these were 

: ies, one was on shock treatment, and one was the brain 
è case. The two aments failed to grow in academic grace, but 

in shrink either. 

sonality progress of these students is harder to ascertain. 

ive appraisal would necessitate frequent observations by 
for that purpose, but lacking these facilities, the only 

tive was to base the progress reports on the daily “es. 


e 


ded from this and comparable classes; so for this our _ 
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in the school. Five made strides toward gaining self-confidence, 
socialization and development of self-control. The two who were 
aments were never a personality problem. The organic case was 
mostly uninfluenced—as might be supposed. Three showed no 
improvement, but, happily, no deterioration either. 


zi FUTURE CLASSES 


One group omitted from the experimental program would seem 
to merit special attention. Here in the South Negroes generally 
maintain an average intellectual achievement lower than that of 
whites, It would often be an unrewarding task to try to bring the 
educational average of the Negroes up to the level of the whites, 
for if a Negro were returned to his peer group with an educational 
level which was very far in excess of the group he probably would 
find acceptance by even his own group more difficult—and accept- 
ance by the whites still impossible—thus aggravating the very 
problem which is crying for solution. 

Luckily, the absolute level of education is of much less impor- 
tance than the relative level. If he returned with a new ability— 
perhaps the ability to read where before he was illiterate—he 
would be in a position to command increased respect from his 
associates and still be accepted. 

Tt would seem clear that the ultimate goals of each case would 
be based on a thorough study of the case itself and how it articu- 
lates with its environment, rather than a remote theoretical cri- 
terion. 

LIFE ACTIVITIES 


The feeling of accomplishment for anyone might also go a long 
way toward counteracting the stigma of being a mental patient 
which is all too often still a part of present-day thinking. The 
patient who is forced to admit, even though only to himself, that 
he is handicapped in his living because of losses due to his mental 
illness, is placed at a distinct disadvantage; but if he could say; 
“Look! I can perform better because of my stay in the hospital,” 
he would be better able to face the readjustment to society and 
the acceptance of his own illness, 

Many things which the non-institutionalized person takes for 
granted could be the source of acute embarrassment to one who 
has been out of contact with society for some time. For example, 
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what embarrassment succeeds a frustrating encounter with the 
ubiquitous coin machine which even the little children can operate? 
Likewise the cataclysms which could be caused by the eye operated 
doors, help-youtself supermarkets, late model automobiles, multi- 
directional traffic signals, express busses, self-operated elevators, 
perhaps even dial telephones and the other myriads ôf gad, 
which characterize our age. 
i { For one engaged in the herculean task of readjusting to a life that 
. has passed him by, many such problems might be critical in caus- 
ing existence either to be smooth and flowing or a succession of 
| accute discomforts. i 
Toward the end of his stay classes would include ‘Living’ which 
stresses smooth manipulation of common situations. The inten- 
sity of this phase would depend on the total time he has been effec- 
tively removed from society, the state he had reached in his pre- 
Í morbid social development and social expectancy. 


ORGANIZATION 


The inclusion of Educational Therapy would not eliminate such 
therapies as Occupational, Recreational and Music, nor would it 
merely superimpose another one of these on the existing structure. 
Rather it would integrate them into a single, coördinated, multi- 
directional therapy for a concentrated attack on the illness. Edu- 

| cational Therapy should include all these and anything else that 
j would tend to increase the individual’s ability to perform success- 
fully in life. 

Because of the variety of backgrounds and the differences in 
goals of those patients admitted to the Educational program it 
j cannot take on a linear form such as is found in ordinary schools. 

These assume that everyone begins at the same level and ends at 
the same level. Here, however, an areal pattern would obtain 
which would allow one to start at his own level and work upward 
through his stay in the hospital toward individual goals. 

Upon entry into the hospital the usual legal, medical and psy- 
chological processing would take’place as rapidly as possible. Part 
of th processing would include information concerning his aca- 

emic past to assist in his placement. Also included would be items 

4 which would shed light on his premorbid educational expectations 
to be used as aids in fixing his goals, and they would be nine 
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also by his mental capacity, his degree of adjustment, rate of im- 
provement and social expectancy. 

His first classes would be simple, small, informal, permissive 
and homogenous groups (perhaps cleanliness only to accompany 
shock treatment). His success would be the desideratum. Classes 
@Buld inelude as much variety as possible and still be within easy 
reach of the student. Their periods should be short, not more than 
thirty minutes, to accommodate their restricted attention spans, 
Periodically some physical activity must be provided to assuage 
the lethargy in the withdrawn ones and to give release to the hy- 
peractive ones; also it has been observed that movies of almost 
any kind have a general appeal, almost magical, and good results 
have been experienced with having refreshments (cookies, pop, 
graham crackers, fruit juices, etc.) at odd times during the day. 
One half day of classes should be considered a full load at the be- 
ginning and as a student’s disturbance abates there would be a 
corresponding increase in the academic load. Students in the ex- 
perimental class responded well to the idea of having free access 
to the equipment and being placed on their honor as to its use. 

Although permissiveness is the key to the first courses, it must 
go hand-in-hand with persuasion and a great deal of encourage- 
ment. It must be remembered that, for the patient, this is not his 

| major battle; so he may be expected to lack a great deal of en- 
thusiasm for school. These first classes must be limited to six ot 
eight, and even fewer if they are severely disturbed or highly 
maniac. 

As the student progresses and is better able to make successful 
adjustment to additional responsibilities, he will be promoted to 4 
larger class with more exacting standards. This graded permissiv- 
ness insures the warmth necessary to the beginner, and gradually 
‘toughens’ him so that he will not be overwhelmed upon release 
from the hospital. 

Concurrently, of course, the psycho-environmental and medical 
aspects would be operative while treatment of his milieu would 
be progressing behind the scene, Medicine has long believed that 
medication followed by reinfection was effort wasted. Similarly 
here. The social services would have as one of it’s chief functions 
either reéducation of the patient’s former environment to a point 
where it no longer constituted a threat to him, or placing the stu- 


Wy OSS where he can bé accepted. Graduation would be synonymous 
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ze from the hospital, and a diploma should indicate 


TEACHER 


s for this program will constitute the major problem. 
wor is ever to be successful, it must be uaderta® 
hich includes an adequate student/therapist ratio. 
rs for such a program would be well-grounded in psy- 
d the basic concepts of psychotherapy. When all is said 

still an organization devoted to the restoration of 
th; education here is merely the key to this restoration, 
z should obscure this salient fact. The teacher should 
ed with educational procedures, especially those which 
the education of exceptional persons. 
o factors alone are not enough. Especially for the first 
therapist must have a warm, easy-going, ‘mushy’ per- 
Which will allow him to identify easily with his students 
‘with him. Students respond much better to a bit of hori- 

st than to a business like, rigorous, discipline-based 
They seem to accomplish more in less time when allowed 
me time in ventilating their feelings in class. 


SUMMARY 


report on an experimental class in Education Therapy 
as conducted at the Austin State Hospital in 1951. The 
e program were (1) to shorten the stay in the hospital of 
temissable, (2) to provide a stable viaduct back to 
‘tating readjustment, (3) thereby reducing the num- 
ions and (4) eventually reducing the over-all num- 
patients in hospitals. 

tool in the task of preparation and readjustment would 
ated educational program. The aims and purposes of 
situation in particular, and perhaps for Educational 
‘general are: 
ide basic academic skills for the student to use to 
future adjustment. 
rovide basic personal skills for the student to use to 
è his future adjustment. 

vide accredited academic work for primary and sec- 


i a 
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4) To provide information which will serve as a basis for their 
discrimination in introspection. 

5) To occupy the students constructively. 

6) To provide activities and entertainment wbich are designed 
to keep the student in contact with his concept of normalcy. 
<A) To provide opportunities for social interaction. 

8) To provide opportunities for success. 

9) To provide an accepting atmosphere. 

10) To collect data which may shed light on these conditions, 
their amelioration, education and perhaps their prevention. 


_ WHO IS PENALIZED BY THE PENALTY 
FOR GUESSING? 


x 


' ALEX O. SHERRIFFS and DONALD S. BOOMER 


University of California a Ay © 


Almost since the introduction of objective tests into the field 
‘of educational measurement, teachers have been concerned with 
the guessing factor as it enters into test scores. Correction formulae 
of varying degrees of complexity have been developed. All are 
variations of the original method of subtracting the number wrong 
from the number right, ignoring omitted items, and considering 
the remainder as the significant measure. 

Generally it has been shown that the application of correction 
formulae results in a wider range of test scores and in downward 
Adjustment of average scores. Some have claimed that chance 
effects are reduced and that a more reliable set of scores results (5). 
Others, however, believe that a correction formula lowers relia- 
bility (4, 6). Ruch (7), after surveying the literature to 1929, 
' toneludes that the effect of a correction term on reliability is 
debatable. . 

Allin all, though the effects of the theoretical corrections for 
chance are completely unclear, such corrections are applied by 
many teachers in the classroom situation. One extreme defender of 
the penalty for guessing aims his correction at the ‘gamblers’, 
‘Dlufiers’, and ‘bullies’ among the students, stating that “The 
Tesults (of penalizing wrong answers) are always the same. Stu- 
dents gradually acquire a new regard for truth, precision of learn- 
ing, the impropriety of guessing, the shame of bluffing, et cetera” (8). 

the present writers have for some time believed that the usual 
notions about the effects of right-minus-wrong scoring have ignored 
r r factors. Our own reasoning has been somewhat 


A course examination is administered in order to evaluate the 
Present ‘achievement’ of students—their knowledge of course 
content, or their ability to utilize their course experience in relevant 
Problem Situations. Í 
i 48 our contention that course examinations should not serve 
~~ “itect tests of intelligence. (If measurement of intelligence, is 


st s, 
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desired, quantities of standardized tests for the purpose are avail- 
able.) Neither ought they be measures of the student’s person- 
ality, nor should successful performance in the testing situation 
be contaminated any more than necessary by personality vari- 
ables, 

ry @Ve believe that when the right-minus-wrong instructions are 
added to an ordinary examination a new set of decisions faces the 
individual student. For example, such questions must necessarily 
arise as: “How certain am I of this answer? What is the probability 
of my being right? If the probability is 80 to 20 that I am right, 
then am I better off to ‘guess’ or to omit the item?” Ability to 
‘figure the odds’, subjective levels of confidence in own judgment, 
and willingness to gamble or to take a chance, are among the 
variables which might well be expected to influence the right- 
minus-wrong measure of achievement. Such influence can only 
make the test less valid as a measure of achievement. 

Tt seems obvious that such a contaminated measure is not 
only less useful for its avowed purpose, but also that it will work 
hardships on individual students because of individual differences 
in shrewdness in test-taking or in other personal factors. 

The present study represents an attempt to find evidence 8 
to the presence of contaminating factors in the conventional 
right-minus-wrong test score. 


PROCEDURE 


Two approaches to the question under consideration suggest 
themselves. One approach involves scoring the performance of 
each student on one test in two ways—by applying a penalty for 
guessing, and by a simple count of the number of items answer 
correctly. The two resulting sets of scores can then be compar 
in their relation to each other and to other pertinent variables. 
A second approach would involve having some course examinations 
of the right-minus-wrong type and others of the ordinary ‘2 


*We agree with Cronbach (1) that “When the test is used to fill other 
purposes, the score becomes invalid for the primary purpose of measure 
ment; essentially the more elements that enter a score, the less meaningt 
it becomes.” A few writers have considered the dangers of contaminating 
measures of Course achievement with the effects of personality difference 
(8, 10, 18) but only Votaw (11) has approached this question experimentally: 


j geene data on the relationship of measures of dominance-submissi” 
x guessing behavior. 
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penalty’ variety; then comparing the student's scores on the two 
types of tests. 
‘There are advantages and disadvantages inherent in both de- 
signs. The first approach was followed in this investigation. 
Subjects—The subjects were students in the second semester 
course in psychology designed for non-psychology majers at Be 
University of California. Of the four hundred and fifty students 
who were enrolled in this course three hundred and ninety are active 
subjects in this investigation. The criteria for inclusion were 
‘that the students have participated in four midterms and the final 
examination and that the students have satisfactorily completed 
‘the Minnesota Multiphasic Psychological Inventory. 
Haperimental Examination—The first midterm examination 
‘consisting of fifty true-false items served as the experimental 
“examination for this study. The class was divided between two 
examination rooms, students whose last names fell in the first 
ag of the alphabet going to one room, the remainder to 
other. 
an In one room ninety-eight subjects took the examination on a 
‘no penalty for guessing’ basis. “Answer all questions, please.” 
This group served as a control for the test-taking condition. 
h the other room two hundred and ninety-two subjects were 
told that the examination was to be scored on a right-minus-wrong 
basis with no score for omitted items. After these students had 
completed their tests they were instructed to circle all the items 
they had omitted. Following the completion of this task, they were 
asked now to indicate what they believed to be the most probable 
right answers for the previously omitted items. The circling tech- 
nique made it possible to obtain three scores for each students 
Performance on the examination. These were (1) a right-minus- 
‘Wrong score (R—W), (2) a score representing the total number of 
items that the student answered correctly when required to try 
: all (No. Right), and (3) a penalty score (Penalty Score) deter- 
mined for each student by the difference between his R—W score 
and his No. Right score. 

Criteria for course achievement—To aid in interpretation of 
results some definition of course achievement was desired. A best 
|| PPtoximation was utilized for the study, consisting "of the com- 
“mens ores on the three midterms following the experimental 
examination and the final examination. True-false questisag, 
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multiple-choice questions and essay questions each contributed 

about one-third to this measure. This measure of course achieve- 

ment turned out to be stable, to yield a wide distribution of scores 

for our sample of subjects, and, in terms of subjective evaluations 

of staff, generally to rank students in accord with expectations 
upen office hour contacts. 

Personality dimension.—It was our hypothesis that students who 
lacked self-confidence, especially confidence in their own judgment, 
who had difficulties in making decisions, and who were easily 
threatened by ambiguous situations, would be penalized by the 
R — W test condition. To test this hypothesis we desired a per- 
sonality measure which might be expected to relate to some of 
these variables. 

The measure determined upon was the A scale of the MMPI. 
This is a new scale of thirty-nine items developed by Welsh (12). 
It is claimed that people scoring high on this scale are characterized 
by introversion, rumination, anxiety, low self-esteem, and undue 
concern with the impression they make on others. The ‘flavor’ of 
the scale, its face-validity, is conveyed by such items as: 


» 


Criticism or scolding hurts me terribly (true) 
I worry quite a bit over possible misfortunes (true) 
\ I have often lost out on things because I couldn’t make up my mind 
soon enough (true) 


The group form of the MMPI was administered to the entire 
class in mid-semester, and for the purpose of this investigation 
was scored for the ‘A scale’? 


RESULTS 
Experimental Groups 
1) ‘R — W’ and ‘No Right’ as measures of achievement.—Com 
parisons were made of the two scoring methods applied to th? 
fifty-point experimental midterm. The Pearson Product Moment 
correlation between the ‘R — W’ and ‘No Rt? measures was .96- 


1 The investigators did not believe, of course, that the ‘A scale’ was the 
only measure worth following up, nor did they believe that they knew ^t 
the present stage of its validation exactly what aspects of personality it 
measured. However, available evidence about this scale suggested it to be 
a good bet. Insofar as the case against ‘R-W’ scoring goes, contami 
ajian by any personality factors, whether or not closely defined, wou! 


portant. 
» 
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n between the ‘R, — W’ scoring of the experimental 
the criterion measure of course achievement was .48; 
o Rt’ and the criterion the correlation was .49. 
Wi scoring method yielded a lower mean score and a 
able distribution of scores. However, this must be seen 
to the extremely high correlation between#R — Y 
Rt scoring, and essentially the same degree of relation- 
n these scoring methods and the criterion for course 
t. 
‘method of scoring and the personality measure.—The ‘A 
d a skewed distribution of scores with a mean of 9.93, 
deviation of 7.8, and a range from 0 to 35. 
ion coefficients were calculated in an effort to assess 
of relationship between the ‘A scale’ score and the two 
roaches to the experimental midterm. The correlation 
‘A? and ‘R — W’ was —.23; between ‘A’ and ‘No Rt’ 
—.22. A coefficient was also derived for the relationship 
the ‘A scale’ and the achievement criterion. This cor- 
‘was —.20. These correlations are all significantly different 
3 


above findings do not answer the question as to whether 
students were differentially handicapped by ‘R — W’ 
because of personality factors. A comparison of those 
high in maladjustment as indicated by the ʻA scale’ 
e low should, however, offer crucial evidence. It was 
study approximately fifty cases at each end of the 
. Because of the presence of a relationship between the 
y variable and course and examination performance, 
from the two ends of the distribution were paired on the 
their total course achievement. It was possible to obtain 
pairs, matched within two points on the achievement 
Thus we are comparing two groups of students who 
d equallysyell on the accepted measure of course mastery 
‘differ only in that they are the ‘highs’ and the ‘lows’ on 
ity variable in question. How these groups functioned 
€xperimental midterm is summarized in Table I. 


n of the scatter plots reveals the relationship between the 
all other measures to be curvilinear. Thus the reported cor- 
‘only be used to predict in one direction—from the ‘A seale’ 
mance measures. A 
a 
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Taste I.—COMPARISON OF PERFORMANCE ON EXPERIMENTAL MIDTERM or 
STUDENTS WITH ‘A SCALE’ SCORES or 15 AND OVER WITH THOSE WITH 
Scores oF 4 anD Less 


® | 


Highs Lows 
Asale, | (Ascale | Mean Dif. | Std. Error CR 
BP ò and greater)| and less) 
Mean R-W Score 18.87 22.85 3.98 1.23 3.23 
Mean No. Rt. Score | 34.89 36.32 1.43 .89 1.61 
Mean Penalty Score* | 16.02 13.47 2.55 -837 3.05 


* The penalty score represents the difference between the gross scores on 
R-W and on No. Rt. It is not a completely satisfactory statistic. However, 
the skewness of the R-W distribution made questionable the use of differ- 
ences between standard scores. 


These findings show that students with high ‘A’ scores were 
penalized by the use of ‘R — W’ scoring in comparison with stu- 
dents whose ‘A scale’ scores were low. The difference between these 
groups is statistically significant at better than the .01 level. 
Further, this is a difference of practical significance, amounting 
to almost a full letter grade within the ‘R — W’ distribution. 

3) Analysis of patterns of performance under the R — W con- 
dition—A study of the ‘omitting’ behavior of the subjects of the 
experimental group was made in an effort to throw additional 
light on the manner in which the higher ‘A scale’ student was 
penalized. 

One hundred and eighty-seven students omitted three or more 
items. These subjects were divided into two groups: those whose 
‘A scale’ scores were 15 or over and those with scores of 14 and les. 
Next, these subjects were paired, high ‘A’ with low ‘A’ on the basis 
of their ‘No Rt? scores. This was to match them in terms of thel! 
knowledge of the material covered by the examination. It was 
possible to obtain forty-four perfectly matched pairs. 

Two comparisons were then made. The first was in terms of 
the number of items omitted by the two groups. Students with 
higher ‘A scale’ scores omitted under the ‘R — W’ conditions 6.29 
items, on the average; those with low ‘A scale’ scores omitted | 
5.09. This difference results in a critical ratio of 2.03 and is 38 
nificant at the .05 level. 

A second comparison was made. The proportion of items later 

Prrered correctly (after circling) by the two groups was studie 


> 
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to see whether the higher ‘A’ subjects differed from the low ‘A’ 
subjects in this regard. Sixty-three per cent of the items omitted 
by the higher ‘A’ subjects were later answered correctly by them. 
Fifty per cent of omitted items were later answered correctly by 
the low ‘A’ subjects. The significance of this difference of thirteen 
per cent was assessed in three ways: In terms of the % resulting 
from analysis of correlated proportions, in terms of the difference 
between paired percentagesutilizing the ~/arc-sine transformation, 
and according to the non-parametric ‘Sign Test’ suggested by 
Dixon and Mood (2). In all three cases this difference proved 
significant at the .01 level or better. 

‘These findings reveal that the excess penalty which we have 
found to accrue to more maladjusted students comes in part, at 
Teast, from two factors. The more maladjusted students omit more 
items, and they omit a larger proportion of items that they are 
Capable of answering. 


Control Group 


‘Téwill be recalled that ninety-six of our subjects took the identical 
“midterm as did the experimental group, but under different con- 
ditions, The controls never believed the examination to be of the 
Penalty-for-guessing type, taking it only on the ‘try all the ques- 
tions’ basis. 

The control group data provides information making possible 

rper interpretation of certain of the experimental group results. 

1) Did the experimental group subjects try less hard on the 
Previously omitted (circled) items when instructed to answer 

mem than they would have done had the examination been a 
conventional ‘number right’ type in the first place? 

2) Did the very situation of being confronted with a ‘R — W’ 
test modify the performances of students in general? 
————— 


“Additional information was obtained on this matter by taking another 
sik oa data. Sixty-six pairs of students were studied. These were 
hed for ‘No. Rt’, but divided into two groups: those who answered 
Correctly better than fifty per cent of those they omitted, and those who 
rered Correctly only fifty per cent or less. A comparison of the ‘A scale’ 

ea for those two groups revealed a mean ‘A scale’ score for those students 

t Successful on omitted items of 11.73, and a mean score for those less 
ot Of 8.42. The mean difference of 3.31 points has a standard error 


M, iS Significant at the .01 level. 


ther Points. The Critical Ratio for this difference is 2.65. The cna: 
a 


y 
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TABLE 2.—CoMPARISON OF CONTROL AND EXPERIMENTAL GROUPS on Basic 
VARIABLES 


Achievement Criterion Erpe taonta] Exa A scale Scores 


2 ə Mean | Std. dev. | Mean | Std | Mean | Std. 


Control subjects (n = | 242.97 | 20.61 35.50 | 4.40 | 10.07 | 8.51 
96) 


Experimental subjects) 241.87 | 19.59 35.87 | 4.21 9.95 | 7.80 
(n = 292) 


- 3) In the case of those students who had been found to be 
handicapped by the penalty-for-guessing scoring, was part of the 
explanation of this handicap to be found in the general atmosphere 
of the test situation itself? 

Should the control group turn in a better performance on the 
examination than did the experimental group, the answer to any 
or all of the above questions could be ‘yes’. It would be difficult 
with our data to determine the relative contributions of careless- 
ness and of insecurity. However, should there prove to be no dif- 
ference, then the indications would be negative on both counts. 

A comparison of the control and experimental groups follows. 

None of these differences between the Experimental and Control 
groups was significant at better than the .30 level. 

In the case of the control group a correlation of .56 was found 
between the experimental midterm and the criterion for achieve 
ment. When this is compared with the .49 in the case of the 
experimental group, the difference is not significant." 

Two final checks were made. Those subjects of the control 
group who had ‘A scale’ scores of 15 and over were compared i 
their performance with those who had scores of 14 and less. Ad- 


* The correlation between the ‘A scale’ and tHe control group score 
on the experimental midterm was —.003. Between the ‘A scale’ and the 
criterion for achievement it was —.070. 

The significance of the difference between these ‘zero’ control correlt 
tions and the comparable experimental ones (of the order of —.20) we 
calculated utilizing Fisher’s Transformation. These differences do reach 
Peo of significance. We are unable to find an explanation for the#? 

ces. 
D 
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| subjects were matched for ‘A scale’ score with 
the experimental group. All other variables were dis- 
the pairing. No significant difference resulted 
parison. High ‘A’ subjects in the control group were 
from low ‘A’ subjects in regard to ‘No Rt’ oe 
ntal midterm, or in regard to course achievemént. Hi 
jects in the control group were not different from high ‘A’ 
in the experimental group, nor were low ‘A’ controls 


from low ‘A’ experimental subjects. 
the control group, then, leads us to the conclusion — œ 
is no evidence of carelessness, or of disturbance in 


performance simply from being in the ‘R — W’ testing 


CONCLUSIONS 
the basis of this investigation we find no reason to reject 


t original hypothesis. We find that students who score on the 
ad justed end of the ‘A scale’ are handicapped by being scored 
— W’ basis, This holds true.even when the students’ 
of course material is held constant. The evidence sug- 
‘that these students are penalized by their tendency to omit 
ms, and to omit items the answers to which they know. 
can see no justification for teachers continuing to use cor- 
j for guessing. The assets accruing to such a system have 
been equivocal, and the liabilities, we believe, are now 
hope that Traxler (9) is unduly pessimistic when he says 
he doubts that school people will ever be willing to encourage 
to guess at every item in a test. He believes that these 
rs would feel this to be condoning carelessness and loose 
We hope that the educator will also take into account 
of fairness, and of respect for the ‘whole child’. 
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TOPOGRAPHICAL ORIENTATION IN 
NAVAL AVIATION CADETS? 


BRANT CLARK anp R. DANIEL MALONE 


United States Naval School of Aviation Medicine, Pensacolag Florid® e - 


‘The problem of how an individual orients himself to geographical 
points is an old one and has involved much theoretical controversy. 
_ Nevertheless, very little clear understanding of the specific proc- 
esses involved has resulted, although radical postulations of e 
‘special senses’ in both animals and man have been common. The 
“most intensive studies have been made by educators who have 
heen interested in geography (4, 15, 16, 18, 19, 23, 27). The majority 
‘of these studies have investigated childrens’ ability to point 
north or to other compass points, while some of them have been 
i ‘concerned with the childrens’ ability to indicate the direction of 
places within their own community as well as distant points. Only 
a half dozen studies have been found which were concerned with 
i _ topographical orientation in human adults. Two of these papers 
N were essentially anecdotal in nature and merely describe the 
Activities of a limited number of individuals having either a high 
degree of spatial orientation or experiences of being seriously 
disoriented (2, 3). Only two investigations (14, $1) have employed 
large number of subjects. These studies have demonstrated wide 
individual differences in orientation to geographical points as well 
as constant group errors. It has been demonstrated that scores on 
tests of orientation to geographical points show low correlations 
with general intelligence, age, knowledge of the vicinity, sex and 
school experience. Vision and mental maps appear to play a pre- 
dominate rôle in aiding the individual in orientation. 
eae importance of adequate orientation is well-known to pilots 
aircraft (7, 8, 22, 26, 28). This has been demonstrated by 
eo flight safety records, (7, 26, 28) analyses of the causes 
a in flight training, (17) and the relationship between 
Tea and visual orientation to success in flight training (17, 20). 
ag reported by pilots (28) show that in spite of their very 


i 
19 ee 
“te not ions or conclusions in this report are those of the authors. They 
of the to be construed as necessarily reflecting the view or the endorsement 
Navy Department. 
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with regard to their geographical position. It is significant to note 
that when this disorientation occurs, it is associated with marked 
nervousness and uncertainty on the part of the individual (2, 28), 

Tt was the purpose of this study: 1) to determine how well Naval 
@riationocadets are oriented to geographical points; 2) to identify 
common errors in orientation; and 3) to compare topographical 
orientation with two other tests of spatial orientation, tests of 
other mental abilities, and certain selected personal data. 


PROCEDURE 


The primary, methodological problem was to develop a tech- 
nique which would determine how accurately Naval Aviation 
cadets are oriented to certain geographical points, since no stand- 
ardized tests were available. A modification of the pointing proce- 
dure previously devised by High (14) was used because it was 
simple, involved a minimum of extraneous factors, was readily 
adaptable to group testing, and the results could be treated 
quantitatively. ra 

The test, which was finally developed, consisted of two parts 
containing twenty items each. Each part in turn consisted of 4 
list of ten well-known cities in the United States and ten well-known 
foreign cities (Tables I and II). In Part I, the subjects were asked 
to imagine that they were in Chicago, Illinois. Using a compass 
Tose as a reference, they were instructed to indicate the direction 
they would travel from Chicago to each of the twenty cities by the 
shortest possible route. The directions were: ‘Write in Column 
below the direction you would go from Chicago, Illinois, to reach 
each city by the shortest possible route. Refer to the compass 
rose for the directions.” They read the heading from the compass 
Tose in degrees and recorded it in the appropriate column on thè 
test sheet. The directions for Part IT were identical with those for 
Part I, except that the point of origin was Pensacola, Florid® 
These cities were selected on the basis of High’s study and # 
preliminary study at Pensacola, eight of them being common t° 
both points of origin. For each part, the cities were selected so that 
they were well distributed around the compass rose. All of the 
subjects were asked to face north during the test because the 
direction one faces during such a test has been found to be ° 

igportance in earlier studies (14, 16). 
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cities and to give a total orientation score. The 
lure used to obtain the total score on this test was 
unconventional. First, the latitude and longitude of each 
s were obtained from Goode’s atlas (6) for computa- 


e Ce culated using a method devised by Ageton (1). This head- 


is the direction a person would take to go from one point on 
earth’s surface to another by the ‘shortest possible route’. 


o points on a sphere. Third, the mean and standard 
n of the cadets’ estimates for each of the forty cities were 
Fourth, keys were made out for each item on the test 
rms of one-half sigma deviations from the initial great circle 
g which was the correct response. Deviations from this 
g within plus or minus one-half sigma were scored zero, 
each succeeding one-half sigma step was scored as a unit, the 
ion of the deviation being disregarded. The sum of these 
ions for all of the forty items was then used as the in- 
’s score on the test. This score was an “error score” inas- 
ch as it was the sum of the individual’s deviations from the 
ial great circle heading for each city. 
Mercator maps, which plot the earth as a flat rather than a 
t surface, are commonly used in schools and elsewhere for 
ence. Such projections of the earth involve gross errors, 
pecially near the poles. However, they are widely employed so 
ees using Mercator plots of the cities were also determined 
_ for comparative purposes (Tables I and II). For some cities the 
g ings are only slightly different, but for others the errors 
ate considerable as in the case of Moscow and Rome (Table T). 
er to compare topographical orientation with other 
ie em traits, additional test scores were obtained from the 
| Students? training jackets. Two additional tests of spatial ability 
j iven tothe cadets in addition to the tests of topographical 


a eieo week. The tests were: 

Essentials of Mathematics for Flight Students.—This test was 
by the Central Examining Board of the Naval Air 
Command. It is a ninety-minute test containing=se 
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hundred and five items which varied in content from simple arith- 

metic computation to the reading of graphs and arithmetic reason- 

ing given to flight students during their indoctrination week. The 

test is essentially a power test, since the time limit is very liberal. 

The standard scores obtained routinely on this test and recorded 

W the treining records were used. No data on the reliability of the 
2 test are available. 

2) Aviation Classification Test.—This is a group test of general 
intelligence used in the original selection of the Naval Aviation 
cadets, and was designed originally to predict success in ground 

D school training. It consists of one hundred and twelve items meas- 
uring vocabulary, comparison, practical judgment, and arithmetic, 
The testing time is forty-five minutes. The reliability of the test 
has been found to be approximately .80. 

3) Mechanical Comprehension Test:—This group test is also 
used routinely in the original selection of Naval Aviation cadets. 
It is essentially a test of elementary physics and mechanical re- 
lationships consisting of seventy-six items with a time limit of 
forty-five minutes. It was designed originally as a predictor òf - 
success in flight training. The test-retest reliability has been found 
to be approximately .75. 

4) Test of Correctness and Effectiveness of Expression—This is 
Test I, College Form A, of the United States Armed Forces Insti- 
tute Tests. There are two parts: The first emphasizes spelling and 
the second contains material on grammar, style, best words and the 
like. The time for this test is two hours and the score is the number 
of items correct. No data are available on the reliability of this 
test. 

5) American Council on Education, Psychological Examination 

, for College Freshman, (1947).—Standard scores were obtained from 
the students’ records for Q (Non-language), L (Language) and 
total score. The standard scores used were based on a group of mid- 
shipmen having completed two years of college work. 

6) Spatial Orientation —This test which is No. 5 in the Guilford- 
Zimmerman Aptitude Survey (13), was designed primarily t0 
measure the ability to appreciate spatial relations of things with d 
reference to the human body. The test contains sixty-five multiple- 
choice items in which the subject is asked to make certain jude 
ments of the position of the prow of a boat with respect to ®® | 


ip? 
y 


d other objects in the visual field. The working time on 
minutes, and the reliability given in the manual is 0.88. 
al Visualization.—This is No. 6 of the Guilford-Zimmer- 
ude Sarvey, and is designed to measure processes of 
movements, transformations, or other changes in 
The subject is asked to make judgments of ‘& 


ing personal data were obtained during the regular 

iod: 1) age, 2) home city and state, 3) flight experience, 
level, 5) travel in the United States (using a five- 

eck list), 6) travel outside of the continental limits of the 
States (using a five-point check list), 7) geography courses 
thin the last three years. In addition to this the cadets 

to give an estimate of their own orientation using & 
scale as follows: 8) orientation to the major points of the 
9) orientation during travel, 10) attention to the direc- 
aj avel in a strange area, 11) use of ‘mental maps’, 12) vis- 

n of the earth (Table IV). 
f the tests with the exception of the ACT and MCT were 
luring the first few days at Pre-Flight School. These tests 
art of an extensive battery given for experimental use or for 
Training Command. The data obtained from these tests 
and punched into IBM cards for computational pur- 
' number of individuals used for computational purposes 
S somewhat throughout the report since data were not avail- 
cadets on all tests, however, the specific N’s are indi- 
h case. 
jects were two hundred and forty-two Naval Aviation 
majority of whom were entering the Navy from 
life; the remainder were entering from the enlisted ranks 
vy and Marine Corps. They had passed the Naval Avia- 
ds on the Aviation Classification Test (above), & 
lical Ability Test (above), and a Biographical Inventory. 
five per cent of the group had completed at least two years 
5 The age range was from nineteen to twenty-seven years 
of 21.57 and a sigma of 1.47 years. 
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RESULTS 
(A) Orientation to Specific Geographical Points 


The mean and median estimated headings for, the several cities 
are listed in Tables I and II. The distributions of the cadets’ 
g 25 of the twenty cities for each point of origin were typically 

wed to a moderate degree. In every case the range of estimated 
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extensive as has been found with other groups 
} rs made by individuals approached 180° (i.e., the 
possible error) in several cases, e.g., Chicago to Cape 
e accurate means and particularly standard deviations 
ed by eliminating the atypical cases which it was 
the result of errors in recording or reading» This Ws 
king an initial distribution for each city, determining 
deviation and then eliminating all of those judg- 
ich deviated more than three standard deviations above 
the mean. Therefore, the data listed in Tables I and IT 
ly those persons who deviated three standard deviations 
from the mean. The number of eliminations in each case 
termined by subtracting the N in the table from two hun- 
two, which was the total number of cadets tested. The 
eliminations varied from zero (Chicago to Dallas and 
y) to eleven (Pensacola to Cairo). The major effect of this 
was to attenuate the standard deviation while the mean 
slightly affected. 
of cadets’ judgments with the calculated initial great 
ding —Deviations of the group from the initial great circle 
will give a measure of the orientation of the group to these 
h points considering the earth as a sphere. The data 
m in showing gross individual and group deviations from 
directions (Tables I and II). Significant differences 
3) between the great circle headings and the group mean 
occurred in thirty-seven out of the forty judgments. The 
varied from 1.60° (Pensacola) to 57.09° (Tokyo) with 
igin at Chicago and from 1.5° (Reykjavik) to 65.1° (Manila) 
origin at Pensacola. The errors were substantially less for 
n the continental limits of the United States. For these 
he mean of the judgments deviated from the true direction 
rent cities by amounts which varied from 1.60° (Chicago- 
cola) to 17.50° (Pensacola~Miami) with a mean of 7.51°, 
eas for cities located abroad, the deviations varied between 
(Pensacola-Reykjavik) to 65.14° (Pensacola~Manila) with 
of 27.04°. The difference between the mean of the errors 
United States and abroad were significant beyond the 
of confidence. The errors to points within the United 
less with the origin at Chicago than when the origin 
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a (P = .05). On the other hand, there was littleg¥f- ` 
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ference in the errors between the two points of origin for cities 
located abroad. These data show that there are constant tendencies 
within the cadet group to misjudge the true directions of cities 
within the United States and abroad. However, these group errors 
are much less than the errors made by individuals. Some illus- 

tions will be useful in showing the size of these errors: New York 
is almost due east of Chicago, but several people indicated that it 
is northeast, while several others indicated that it is southeast of 
Chicago. From Pensacola to Panama, the initial great circle heading 
is approximately southeast, while the estimated direction by nearly 
half of the cadets was southwest. Very large errors were also found 
from Chicago to Tokyo, Moscow, Rome, and Oslo, and from 
Pensacola to Cairo, Manila, and Tokyo. x 

Comparison of cadets’ judgments with headings determined from 
Mercator projection maps.—A major problem which has confronted 
map-makers for many years is how to make a satisfactory projec- 
tion of the spherical surface of the earth on a plane surface. All of 
the methods which have been devised to do this result in certain 
distortions of shape, scale, angle, and area. The Mercator projec- 
tion has been widely used in atlases in the past notwithstanding the 
fact that it produces gross distortions of size as the poles are ap- 
proached. Deviations of the cadets’ judgments from the direction 
of the cities as determined from Mercator projections were cal- 
culated as in the preceding section (Tables I and II ). The mean of 
the deviations of the cadets’ judgments from the Mercator heading 
were substantial. However, these deviations were less than the 
deviations from the initial great circle headings. Differences be- 
tween the mean of the cadets’ judgments and the Mercator 
heading between the cities exhibited critical ratios greater than 
three in thirty-two out of the forty judgments. The mean of the 
deviations for points within the United States was less for the 
Mereator projections than for the great circle headings, but this 
difference was not Statistically significant. The deviations from 
the Mercator headings for cities outside of the continental limits 
of the United States were significantly less than those for the great 
circle headings (P = .02). The deviations varied from 0.09 
(Tokyo) to 23.56° (Rio de Janeiro) with the origin at Chicago and 
from 0.13° (Cairo) to 27.92° (Panama) with the origin at Pensacola. 
For cities within the continental limits of the United States, the 


p Sppilest deviation for the group was 0.30° (Chicago-Atlanta), 
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t was 15.07° (Pensacola—New Orleans). The mean 
‘Mercator headings within the United States was 
than for cities abroad (P = .03). These data show 
hat gross deviations from the Mercator headings occur 
group and for individuals within the group. It is also 
the cadets are more closely oriented to the earth Sa 
than to the earth as a sphere. 
—These results demonstrate that many Naval Avia- 
are very poorly oriented to certain geographical points. 
show that there are large group errors as well as in- 
s. The results are in general agreement with other 
have found great variability from person to person, 
ntial’variation from place to place with regard to group 
81, 32). However, these results are directly opposed to 
(82) reports of bimodal distributions. His subjects 
‘to fall into two groups, one of which was well oriented 
other making approximately a chance score. No clearly 
bimodal distributions were found in the forty cities studied 
although the distributions were skewed. The explanation of 
erence is to be found in the fact that the cadet population 
and much more homogenous than the group studied by 
g. These cadets were all men of a limited age range, having 
ptitude test scores and, in general, extensive academic back- 
id. On the other hand, Woodring’s group was highly hetero- 
“Of the one hundred fifty-six subjects, one hundred and 
white and forty-nine were Negroes; one hundred and 
were males and thirty-nine were females. The age range 
seventeen-sixty. Intelligence quotient—by the Revised 
inet Scale—ranged from 44-122. The educational range 
n no formal education to college graduation.” (82) Since 
found several of these factors to be related to orienta- 
conceivable that selective factors may have contributed 
nodal distribution in his population. 
results of this study are in agreement with High’s (14) 
ig that foreign cities show greater errors than cities within the 
States and Woodring’s finding that distant cities show 
errors. It is interesting to note, however, that High (14) 
great variability for points within the city where his sub- 
ik the test. This may have been a function of the selection 
and the fact that some of his subjects were nota 
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oughly familiar with the city. However, High’s data suggest that 

a person may be poorly oriented in a limited area with which he is 

fairly familiar but well oriented in a larger area, Additional com- 

parisons with High’s data are possible since ten ‘of his cities were 

identical with the ones used in this study for the point of origin 
>» @ al? Chicago. The cadets were somewhat more accurate and less 
variable than the college group. However, the differences between 
the groups do not appear to be great in the light of the differences 
between the groups and the test situations. There was no significant 
difference (C.R. < 3) for four of the cities (New York, Panama, 
Cape Town, Moscow) while significant differences were found for 
six cities (Miami, New Orleans, London, Tokyo, San Diego, and 
Seatitle). 

The data show that the orientation of the cadets at the begin- 
ning of training is much closer to Mercator projections than to great 
circle or global projections. This is true in spite of the fact that the 
deviations are large in both cases. It would appear that the wide 
use of Mercator and other similar flat projection maps in school 
and elsewhere has had a marked effect on the cadets’ orientation 
to points on the earth producing marked disorientation, par- 
ticularly to foreign cities. Greater emphasis on the relation of 
geographical points to each other on a globe in schools and else- 
where would appear highly profitable to reduce or eliminate such 
gross errors. 

No attempt was made in this study to analyze the mechanism 
used to make the judgments of the direction of the cities. However, 
certain inferences may be drawn from the data at hand in terms of 
the type of errors made. Firstly, it is evident that the cadets as 4 
group have acquired certain erroneous associations in terms of 
distances and/or direction. For example, the cadets as a group 
believe that Panama is well west of Chicago and Pensacola when it 
is actually east of these cities. Secondly, the cadets’ orientation 18 
much closer to Mercator projections than to a global map. Indeed, 
they show little appreciation of great circle courses between points 
and apparently have little appreciation of the location of cities 
on the world as a sphere. An illustration of the very large errors 
which result from this type of disorientation is to be found in the 
group deviation of 65.14° from the great circle heading from Pensa 
cola to Manila. Although Manila is slightly south of Pensacola, the 

; ircle heading would be roughly north-northwest because the 
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well north of the equator. However, the mean of the 
ents for the shortest route was slightly south of due 
). The deviation of this direction from the Mercator 
only 3.14° which was not significant statistically. 
s, the cadets as a group are fairly well oriented to 
the point of view of Mercator projectionsp but ty 
ly disoriented from a global point of view. Similar 
tions from great circle headings with small deviations 
tor headings are found from Pensacola to Cairo and 
d from Chicago to London, Rome and Tokyo (Tables 
Tt is interesting to note that the deviations from the 
heading are greater from Chicago to Moscow. The 
to indicate a heading close to the North. This may be 
| as an artifact resulting from the instructions which point 
the Great Siberian Plain can be reached most directly by 
near the North Pole. These results give an explanation for 
that foreign cities show greater errors than cities within 
ed States. Mercator projections are sufficiently accurate 
the United States to produce only small errors, whereas 
projections are inadequate for cities abroad. Foreign 
e Toronto and Mexico City which are fairly well repre- 
y the Mercator projection show errors of similar amounts 
Mercator projection and great circle heading. 


Relationship of Topographical Orientation to Other 
Psychological Factors 


orrelations between the scores on the topographical 
test and other psychological factors are summarized in 
- Most of these correlations are not significantly different 

and all of them are extremely low, indicating that per- 
on the topographical orientation test is not closely related 

nce on the other tests given. It should be noted that the 
cal orientation score is an error score, since it was the 
the individual’s deviations from the initial great circle 
for each city. (The odd-even reliability of this test was: 
; Part II—.65; and Total Score .75, when corrected by 

n-Brown formula, the reliability of the total score 
ès 86). The test of topographical orientation showed the 
tionship to the Q score on the American Council on 


Psychological Examination and the test of i 
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Taste III.—Correlation between the Test of Topographical Orientation 
and Other Variables. (Note that the topographical orientation score 
is an error score.) 


Topographical Orientation 
N 
2 @ a PartI | Partir | Total 
1) Spatial Orientation (Guilford-Zim- «06 | —.01 | —.04 233 
merman, Part V) 
2) Spatial Visualization (Guilford- | —.13* | —.07 | —.11 233 
Zimmerman, Part VI) 
3) Aviation Classification Test —.06 | —.10 | —.09 224 
4) Mechanical Comprehension Test —.14* | —.02 | —.08 224 
5) American Council on Education—Q| —. 19ł | —.19f | —.21f | 218 
6) American Council on Education—L | —.04 | —.08 | —.07 218 
7) American Council on Education— | —.15* | —.17* —.18f | 233 
Total 
8) Correctness of Expression —.15* | —.14* | —.16* | 233 
9) Mathematics —.18f | —.21¢ | —.21f | 232 
10) Personal Estimate of Orientation | —.19t| —.09 | —.15* | 233 
11) Age 01 .06 04 | 231 
12) Educational Level 04 -08 .07 | 238 


* r —.13 significant at .05 level. 
+r —.18 significant at .01 level. 


matics. There is a less significant correlation between topographical 
orientation test scores and the cadets’ own evaluation of their orien- 
tation and the test of correctness and effectiveness of expression. 

No significant relationship was found between the test of topo- 
graphical orientation and the test of spatial visualization, which 
is contrary to the results found by High (14), who reported an T 
of .51 for his group. The reason for the discrepancy between High’s 
results and the results of this study may be found in the fact 
that (a) High’s group contained only seventy-five individuals, and 
(b) High’s group was much more heterogeneous, containing both 
men and women college students. Tt would appear, therefore, 
that in a cadet population, spatial visualization and spatial orien- 
tation are not related to topographical orientation, as measured by 
this test. 

The data for Naval Aviation cadets confirm Woodring’s findings 
that there is no relation between topographieal orientation and 2° 


} Oppbialligence; this is true notwithstanding the fact that Wood- 
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ulation was highly heterogeneous (31, 32). As has been 
t above, Woodring also found for his subjects (based on 
aphical points) that the distribution of average scores 
bimodal and the individuals could be classified into three 
r distinct groups. No evidence of bimodality was found eee 
dual cities as noted in the preceding section. Sinfilarly, the 
total score for the topographical orientation test gives no evidence 
of bimodality; however, the scores do fall on a skewed distribution. 
Tor the group of two hundred and thirty-three persons used in 
rt of the study, the mean was 78.04 with a standard devia- 
of 33.40 and a range from 40 to 390. However, only four of 
these scores fell between 187 and 390, while only six scores fell 
138 and 179. If these ten scores are eliminated, the mean 
slightly reduced to 73.19, while the standard deviation is greatly 
luced to 16.80. The remaining scores show only slight skewness 
dex skewness equals .30) toward poor orientation as compared 
with a moderate index of skewness (.50) for the total population 
). The distribution of scores on the topographical orientation 
can be described as being moderately skewed toward poor 
orientation, but thereisno indication of bimodality in these data. 
A composite personal orientation score was obtained for each 
Of the subjects (items 5-12, Table IV) from the personal data 
listed in the procedure. Each of these eight items was assigned a 
Weighted score depending upon which of the choices listed under 
| that question was checked. The scores for the individual items were 
then summed and this total was assigned as a measure of the per- 
Sonal orientation. To determine whether there was a relationship 
between the answers to each of the questions and scores on the 
topographical orientation test, the following procedure was fol- 
lowed for each question: 1) the responses were sorted into like 
groups, i.e., all subjects who marked question 5, ‘chiefly near home 
town’, were placed in one group; ‘chiefly in home state’ in a second 
group, eto. 2) The scores on the topographical orientation tests for 
each of these groups were obtained and an analysis of variance was 
eed to determine where the differences between group per- 
jomances were significant. 3) If a significant F was obtained, thus 
Indicating that several groups differed significantly among them- 
Yes, a t test was run to determine those differences between 
means which were significant. 

Questions 4, 5, 9, 11 and 12 provided F's significant at t=QQ1 
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Tasty IV. SUPPLEMENTARY DATA 
NANG s/o. POA ARRES A IAEN iU 1. Age (nearest year)...... 


2) Homeicitys a aE AA A n les A A 
3) Flight status: . .. aviation training, stage; 
PIOA NNAS nonpilot; approximate hours of flight 


p Dime. 

4) Encircle the highest educational level finished: High school—1, 2, 3,4 
years; College—l, 2, 3, 4 years; Post graduate. 

5) Travel in the United States: ......... chiefly near home town, 

VAUR chiefly in home state, ......... chiefly in home and nearby 

one or two transcontinental trips (or trips over 1,500 


D miles), ......... three or more transcontinental trips (or trips over 
1,500 miles). 
6) Travel outside the continental limits of the United States: 


E aa SOTAA NA one or two visits to Canada or Mexico, 
-. Several trips to Canada or Mexico, ......... one trip over- 
AES RRT two or more trips overseas. 
7) How many courses in geography or related subjects have you had in 
the last three years? ............. 
8) Do you keep the direction of the major compass points (N, 8, E, W) in 
mind when riding as a passenger in a car? 


Always Very often | Sometimes if Occasionally | Almost never 
butnotal-| I have a| when not 
ways reason familiar 
with the 
terrain 


9) Do you usually orient yourself in travel by means of some reference 
point, such as a city or some other place whose location is familiar to 
you; i.e., can you point to it if you wish? 


feet a 
Never Occasionally | Sometimes if| Quite fre- | Always out of 
if Ihavea | possibility | quently habit 
special of becom- 
reason ing lost 


10) If you are traveling in a strange area (city or country), do you try t9 
remember the number and direction of the turns you have made 8° 
that you will be able to find your way back the same way if necessary? 


pie Met | Co Ed ec 
Almost always! Frequently | Sometimes if Occasionally | Never 
I have a| if knowl 
special rea-| will have 
p> son to return 
the same 
o 
] 


way 


Taste IV.— Continued 


u na EPA 

mental map” of the area you are in, i.e., if you were 
lo go could you reproduce a rough approximation of the 
ich you find yourself as by drawing a crude map? 


I do this as | Very often | Invariably 
much as through 
most peo- habit 


ple 


lize a globe of the world with the continents and major 
land in their approximate positions with respect to each 


Believe I| My picture | Unsure of | I have a very 
know posi-| of globe is shape and poor pic- 
tions of about aver-| position of| ture of 
continents | age some conti-| globe in 
very well nents mind 


ubsequently the differences between the means of the 

for each question were examined to determine those 
8 which were significant. For question 4, the group with 
90] education was significantly better oriented (P = .01) 
group with one year of college. For question 5, the group 
d answered, ‘chiefly in home town’, had poorer orientation 
than the groups which had answered the remaining four 
i or question 9, no significant differences were obtained 

the means of the groups; this tends to indicate that al- 
Samples were not drawn from a common population 


which had answered, ‘invariably through habit’, showed 
ation (P = .05) than the groups which had answered, 

and ‘I do this as much as most people’. For question 
up Which answered, ‘My picture of the globe is about 
poorer orientation (P = .01) than the groups 
, ‘confident I have an accurate picture of globe’ 
know position of continents very well’. Also, the 
Answered, ‘unsure of shape and position of some conti- 
ed poorer orientation (P = .05) than the cadets who 
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‘confident I have an adequate picture of globe’. =q i 
for questions 8 through 12 were examined to dete 4 S 


t 
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mine how closely the answers approximated the distribution of 
individuals normally expected to fall within the various classes of 
ratings (12); assuming that the trait that is rated has a normal dis- 
tribution in a fairly large unselected group. None of the questions 
wt the criterion. It must be pointed out that this lack of continuity 
of the items of these questions no doubt had an important effect 
upon the variable results which were obtained. It is recognized 
that this weakness must be remedied in any future use of a personal 
orientation questionnaire. For this reason, a minimum amount of 
D emphasis is placed upon the results previously mentioned. How- 
ever, it is also noted that the results obtained point favorably to 
the feasibility and utility of such a revised personal biographical 
instrument to augment any future studies on topographical orien- 
tation. 

Discussion—The data show that, the factors measured by the 
topographical orientation test are unique. Very low correlations 
were found between these scores and those on two other orients 
tion tests, two intelligence tests and a mechanical aptitude test. 
A similar result was found for a variety of personal data, though 
some individual items of the personal questionnaire did contain 
items which were significantly related to topographical orientation. 
The fact that those who indicate travel, ‘chiefly near home town’, 
show poorer topographical orientation than those with more è% 
tensive travel experience might have been anticipated. However 
it is interesting to note that the other categories show no significant 
differences. The significantly better orientation of individual’ 
with no more than high-school education as compared with tho § 
with college training tends to support Woodring’s (32) finding that 
persons with lesser amounts of schooling do better on such test® 
The non-college cadets come from a group which has had a reli 
tively high amount of travel experience in the Navy which is W 
doubtedly an important factor. It is apparent that the current 
selection devices in the Navy do not include a variable related t0 
topographical orientation as measured by this test. ‘The relation 
this composite score to success in Naval Aviation Training is ™ 
Bomi However, a study currently underway will obtain date 0° 

is point. : 

It will be noted that all of the deviations measured by this te 

measured from initial great circle headings which would we 
p : Pre torrect initial heading treck the city by the shortest possi 
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calculations were made using the mean heading 
| by the group. This attenuated the error scores, but it did 

nge the intercorrelations significantly. Scatter plots of the 

ndicated that the elimination of the cadets having the 

ely deviant scores would not cause any significant chage 
elations. ` 


SUMMARY 


topographical orientation and seven other tests were 
0 two hundred and forty-two Naval Aviation cadets to 
types of errors in orientation they make, to identify 
which contribute to this type of disorientation, and to 
ne what relation there is between topographical orientation 
other psychological factors. In the test of topographical 
, large errors were found to be the rule for orientation 
dual geographical points. Deviations of the group from 
tation were as great as 65.1°, while some individual 
from the proper orientation approximated 180°, the 
possible error. Errors for cities located abroad were 
for cities within the United States. Deviations from 
headings were large, but these errors were much less 
deviations from great circle headings, particularly for 
ide of the continental limits of the United States. Cor- 
between total score on the test of topographical orientation 
et psychological tests were .21 or below; only three of them 
ficantly different from zero. Naval Aviation cadets are 
ted to geographical points, particularly as they ac- 

on the globe. The notion of great circle relationships 
tant cities does not appear to enter significantly into 
nts of the geographical points. Topographical orien- 
ears to be a unique characteristic which is independent 
“ors measured by the current selection battery for cadets 
Certain other factors such as mathematics, spatial orienta- 
Spatial visualization. 
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STUDENT DIFFERENCES IN COMPETENCY 
IN EDUCATIONAL PSYCHOLOGY PRIOR 
TO FORMAL TRAINING’ 


8 WILLIAM W. LYNCH, JR. 
Indiana University 


The research to be reported here is an attempt to analyze and 
study two types of competence that are commonly stressed in the 
training of prospective teachers in educational psychology: (1) 
familiarity with the principles, facts, and value judgments most 
typically presented in textbooks in educational psychology, and 
(2) ability to interpret information about an individual child’s 
behavior problems, as presented in written case-study form, and 
make valid recommendations for the solution of these problems. 
The need for such analysis can be supported by a review of the 
literature of the past several decades dealing with the content and 
teaching of educational psychology. In recent years, particularly, 
the discussions of ways and means of improving instruction in 
educational psychology have stressed the opinion that the psycho- 
logical competencies needed by teachers are not fulfilled by 
familiarity with principles, facts, and authoritative judgments, 
and that this familiarity and its applicative concomitants arè 
inextricably involved with the prospective teacher’s total per- 
sonality make-up. Just what the nature of these relationships may 
be is far from clear. It was the intent of the present study to probe y 
this complex of relationships by studying the two abilities men- 
tioned above as measured in a population of teachers colleg? 
students planning to become elementary teachers who had re- 
ceived no formal instruction in psychology. By selecting such a2 
untrained group it was possible to discern the extent of differences 
in these two abilities prior to training and to study the relationship 


of these differences to several psychological characteristics a 


background experiences of a primarily non-academic nature. a 
ci; 


related factors selected were: tested intelligence; personal and so! 
adjustment; occupational interests; attitudes toward child care aP 
discipline; and amount of past experience in caring for, teaching 
or directing children. 
pe in part upon the writer’s dissertation submitted for the PhD. 
n fát Yale University, 1950. aa 
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_ HYPOTHESES TO BE TESTED 


of a review of experimental studies and published 
pirion, and a logical analysis of the factors involved 
the following seven hypotheses were formulated for 


lents who have had no instruction in educational psy- s 
er significantly in their familiarity with the facts and 
this field that are contained in textbooks. Some of the 
‘principles are familiar to a majority of these students. 
dents who are most familiar with the content of educa- E 
chology are best able to interpret case-study data and to 
ilid recommendations for remedial measures. 
Among persons with no formal training in psychology there 
! relationship between tested intelligence and both (1) 
iliatity with facts and principles and (2) ability to interpret 
data correctly. 
‘ospective teachers with no formal. psychological training 


sured interests in social activities and in the care and 
of others are more proficient in both knowledge and 
tive ability than those who show little or no interest in 
tivities. 

Good social and personal adjustment is positively corre- 
h the informational and applicative criteria of proficiency 
ational psychology. 

ersons who manifest strict authoritarian attitudes 
child behavior are poorly informed in educational psy- 
d are relatively lacking in the ability to interpret case 


f SUBJECTS 
ndred and forty freshmen enrolled in three Connecticut 
hers colleges that prepare elementary teachers were 
test the hypotheses. These were divided into subgroups. 
in each subgroup and the hypotheses to be tested 


Lin Table I. All testing was done during the first half | 
ester of the 1949-1950 academic year. Š A 
© 
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TABLE 1. ARRANGEMENT OF GROUPS FOR TESTING 
College N Tests Hypotheses 
A 220 | Subtest on ‘Personality? I, II, II 
p 47 | Subtest on ‘Growth and Develop- 
© ment’ 


Subtest on ‘Individual Differences’ 
48 | Subtest on ‘Motivation’ 
48 | Subtest on ‘Learning’ 
57 | Subtest on ‘Problem-solving’ 
58 | Subtest on ‘Evaluation’ 
69 | The Case of Mickey Murphy 


B əf 131 | Abbreviated test of Information TEIL V, VA 
The Case of Mickey Murphy 
| The Minnesota Personality Scale 
A phe Kuder Preference Record 
|B. 9. 
C | 93° | Abbreviated Tesf of Information | II, III, IV, VII 
The Caseof Mickey Murphy 
Experience Rating Scale 
Scale of Attitudes Toward Child 
Behavior 


CRITERION TESTS 


1) Content Tests—A comprehensive battery of 432 objectiv® 
test items was constructed by the writer and an assistant. These 
items were based upon the content of fourteen textbooks in educa 
tional psychology published between December, 1945, and August, 
1949. The battery was divided into seven subtests covering the 
following topics most commonly treated in the textbooks: 7 

Individual differences, growth and development, personality, 
motivation, learning, problem-solving, and the evaluation of 
school achievement. The curricular validity of the battery W% 
checked by submitting the items to four educational psycholo 
whose training and teaching experience presumably qualified them 
to judge the validity of the items. The split-half reliability of the 
seven subtests, corrected by the Spearman-Brown formula, rang 
from .21 to .67, the median being .53. An abbreviated form of the 
entire battery was constructed by selecting items at random 
from the original subtests in proportion to the amount of spa? 


] gpa to the subtest topics in the fourteen source textbooks. THS 
o 


j 
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formula. 
Study Test —A published case-study test, The Case of 
y?, was selected as the criterion measure of ability 
a problem involving a young person in a school sitya- 
make valid recommendations for remedial pfoceduré’. © +- 


tested in this study was .64 when corrected by the ` 
Brown formula. 

UUN © 
DEPENDENT VARIABLE MEASURES ` 


higence.—Scores on the California Test of Mental Matur- 

anced '47 S-Form, were obtained for all subjects. This test 

administered as an admissions test prior to,this investi- 
pa ~ 


o with Children.—A check list*of activities such as 
Sunday-school teaching, etc., Was devised and accom- 
rating scale designed to indicate the extent of ex- 
the care and direction of children. These instruments 
upon a preliminary questionnaire study of the ex- 
of a group of education and psychology students. The 
this study were asked to check all of their experiences 
e care and direction of young people, and then were 
estimate the extent of this experience when compared 
full-time classroom teaching in terms of actual time 
ontact with children. Length-of-experience intervals 
‘one to three months’ to ‘two or more years’. 
sts —The Kuder Preference Record—Occupational was 
measure of occupational interests. The data from male 
subjects were treated separately. 
onal and Socidl Adjustment. —The Minnesota Personality 
Administered to the freshmen used in testing Hypothesis 
female data were treated separately. 
s loward Child Behavior—The writer constructed a 
m attitude scale based upon the written opinions of 
classes in psychology and child development on a 
problems such as discipline, freedom of choice 


nar The Case of Mickey Murphy, Lincoln: The ve ; 
tal a A 
~ 


j p 
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for children in everyday matters, the use of punishment and 
reward, academic standards, and the characteristics of a successful 
public-school teacher. The scale was designed to measure on an 
‘authoritarian-permissive’ continuum. The Likert technique for 
Pesce and scoring was used. No direct check on validity 
could be’made, but a preliminary scale was submitted to three 
experienced instructors in education and psychology who were 
asked to indicate the responses to each item that would reveal in 
their opinion attitudes at each end of the postulated continuum, 
Items on which the judges disagreed or which were judged to be 
either ambiguous or not related to the postulated continuum 
were discarded. The split-half reliability, corrected by Spearman- 
Brown formula, was .58. The test-retest reliability, with an inter- 
vening interval between testings of about four months, was .65 
for 34 subjects. 


PROCEDURE 


Hypothesis I.—The schedules of the two hundred and twenty 
freshmen subjects in college A precluded the possibility of adminis- 
tering the entire battery of content tests to each freshman. There- 
fore, the group was divided into five sections and three of these 
sections were administered three content tests each, and two set- 
tions were administered one content test and the case-study test 
(see Hypothesis III). All groups took one of the subtests in common 
so that a rough check could be made of the equivalence of the five 
freshman sections (Table I); 

The content test scores were treated both as raw scores and ae 
‘true per cent scores’ which were based upon theoretical ‘zero 
scores (scores theoretically obtainable by chance), The mean 
‘true per cent scores’ and the ranges of raw scores were compa 
for the seven subtests to determine whether a distinct profile of 
subtest scores existed among freshmen. The freshman scores were 
then compared with the distributions of scores obtained from 
seniors in the same college and the extent of overlap was deter- 
mined. Finally, an item analysis for all content tests was carried out 
in order to determine significant differences in responses of fresh- 
men and seniors. 

Hypothesis TI. —One hundred and thirty-one freshmen in college 
B and ninety-three freshmen in college C were tested with H 
lated’ content test which was derived from the tot# 


between À and the two criterion tests made it i doit 
order partial r’s between the two criterfon sco 

ut IQ. 

es III, V, VI, and VII.—These hypotheses were tested 

uting zero-order correlations between each of the two 

easures and the measures of intelligence, occupational 


mined by self ratings of length of experience.’ 
cond method of testing Hypothesis IV was devised by re- 
e self-ratings together with the check list data and 


RESULTS 


q 


Ypothesis I, positing the existence of differences in fa- 


pective teachers, appears to be substantiated by the data 
: N the first place, on both the separate topical area tests 
the total test the freshmen scored significantly above the 
cal ‘Zero’ score obtainable by chance. In the second place, 
teshman scores generally ranged from a low around the 
re to a point sufficiently high to result in a mean overlap 
Per cent of freshman scores above the median scores of 
ho had completed at least three courses in psychology 
child, and educational psychology) in addition to methods 
and practice-teaching. These two general findings indicate 
shmen do differ significantly in their knowledge of the facts 


Table IT. 


mall number of experience categories and a skewed distribution 
Correlation treatment. 


x 
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TABLE 2.—SUMMARY or RESULTS on CRITERION TEST oF EDUCATIONAL 
PSYCHOLOGY CONTENT 


LEW Median 
Group N Test citeal |Ñredian| Range | Senior 
Score Score’ 
gooo 
College A 47 | Growth and develop-| 24.1 | 33.6 | 17-46 | 38.7 
ment 
45 | Individual differ- | 24.5 | 31.2 | 25-41 | 38.2 
ences 
47 | Motivation 25.5 | 37.4 | 24-45 | 43.8 
48 | Learning 24.4 | 35.5 | 29-48 | 39.9 
57 | Problem-solving 23.0 | 34.6 | 22-43 | 40.5 
58 | Evaluation of school] 22.2 | 32.6 | 20-40 | 38.8 
Achievement 
All Sections 220 | Personality 22.7 | 33.2 | 20-45 | 41.9 
Colleges B and C | 224 | Abbreviated form 22.0 | 33.0 | 20-45 | 38.9 


* Scores obtained on 112 Seniors in College A, 52 Seniors in Colleges 
B and C. 


The freshman responses to the separate items in the content 
battery, when compared with the senior responses, indicate general 
familiarity with certain facts and principles in educational psy- 
chology, at least at a recognition level. Of the four hundred and 
thirty-two items in the total battery, two hundred and eight were 
answered correctly by a significant majority of freshman subjects. 
Two hundred and fifty items showed no significant difference 
between the freshman and senior percentage answering correctly. 
Of the one hundred and eighty-two items on which there were 
significant differences between the two groups, twelve were an- 
Swered correctly by a significantly larger proportion of freshmen 
than seniors, 

It was difficult to categorize the items which most freshmen 
answered correctly. Generally speaking, they consisted of generali- 
zations expressed in nontechnical language. In some cases fresh- 
man superiority over senior responses plainly indicated that gener 
information or ‘common knowledge’ that is valid psychological 
fact had been distorted by seniors through poor comprehension of 
psychological data during training.‘ The mean freshman score? 


‘For example, ninety-six per cent of freshmen recognized that ‘drill 
es learning,” while only seventy-six per cent of the seniors answel 
& 


nests, while showing some small differences that ap- 


ey 


e vontent test battery. 
caution is in order in interpreting the results per 
pothesis I. No attempt was made to discern evidenct-of 
ets which conceivably may have affected some of the 
8 results. Secondly, it is not redundant to point out 
I ior of these subjects being measured was most 
fa very low order recognition type of response and the 
liarity, used in the statement of the hypothesis, should 
eted accordingly. 
pothesis II, stating that there would be a positive rela- 
between fatuiliarity with textbook content and ability 
ret case-study data, was definitely confirmed. For College 
College C freshmen the uncorrected correlations between 
t test scores and the case-study test scores were .46 and 
tively, (both significant at the .01 level). When corrected 
ntuation these coefficients were .76 and .61. Partial 7's 
omputed, holding tested intelligence constant. The partial 
e 64 and .55, respectively. These results indicate that 
cally the relationship between the two variables is fairly 
somewhat higher, in fact, than those found in most ieee 
f ‘the relationship between knowledge and ability to apply." 
® positive relationship between tested intelligence and 
0 criterion measures of psychological proficiency posited in 
he IIT was borne out. College B and College © freshmen 
i orrelations of .37 (SE, = .08) and .43 (SE, = .06) for 
test scores, and .39 ROE = = .08) and .35 (SE: = :09) 


educational objectives. 

Troyer and Horrocks, John B. ‘‘Case-study tests of ability 
ledge of human growth and development,” Educational and 
easurement, 7: No. 1, 23-26, 1947. Ralph W. Tyler, “The 
n recall and foals Al processes,” Chapter 11 in Charles 
‘ation as Cultivation of the Higher Mental Processes. New 


n Company, 1936, pp. 6-17. E 2 A 
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4) Two methods of selecting high- and low-experience groups 
from the freshmen of College C and comparing these groups’ 
mean scores on both criterion tests yielded no differences that 
were statistically significant. > 

Tnterest test scores in ‘social service’ for male and female 
suivjects yielded correlations with the criterion tests that did not 
differ significantly from zero. 

6) Separate correlation of five subtest scores on the Minnesota 
Personality Scale with criterion test scores for male and female 
groups did not differ significantly from zero. 

7) Hypothesis VII, which assumes a relationship between 

attitudes toward child behavior on an ‘authoritarian-permissive’ 
continuum, seemed to be confirmed. The correlation of raw scores 
on the attitude scale with the criterion tests were: content test, 
.24 (SE, = .10) (significant at the .05 level but not at the 01 
level); case-study test, 82 (SE, = .10) (significant at the .01 
level). These indicate a tendency for those with ‘permissive’ at- 
titudes to score higher on both tests. When corrected for attenua- 
tion these correlations become .51 and .52, respectively. 
_ These results are not surprising when an examination is made 
of the separate items on both tests. Recognition of the ‘correct’ 
answers on the content test frequently requires the testee to 
make a value judgment rather than a decision of correctness of 
fact or theory. The sampling of the textbook content required for 
this test included judgments of textbook writers concerning desir- 
able educational practice. The same point may be made with 
regard to the case-study test. Here also many questions are Con- 
cerned with remedial practices in a school situation. A person 
generally in favor of more lenient or permissive handling of educa- 
tional problems would be apt to do better on either test. 


SUMMARY AND CONCLUSIONS 


Freshmen in three Connecticut state teachers colleges were 
administered tests of information based on the content of fourteen 
textbooks in educational psychology and a case-study test purpo't- 
ing to measure ability to interpret and make valid remedial r oo 
mendations concerning an educational problem of a psychologic® 
nature. Scores on these tests were examined to determine: 
the extent of differences in ability to recognize correctly fact 

jies, and authoritative opinion as contained in education® 
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books prior to any formal training in this subject; 


ity to interpret case-study data; and (3) the relationship 
two measures of psychological proficiency to measures of 
nce, experience in care and direction of young peole, 
in social service, social and personal adjustinent, and 


s and generalizations of educational psychology prior to 
and that there is a significant spread of differences in such 


ntly related both to the knowledge and to the applicative 
ects of educational psychology. Experience with children, 
vation interest in social service, and personal and social ad- 
tment as measured by the paper-and-pencil instruments used, 
{ low NO significant relationship with the two criteria of proficiency. 
ruments whose validity and reliability were suitable for a 
theoretical examination of the attributes concerned and 
the precise testing of exact hypotheses. In spite of this 
Nature of the study, several implications pertaining to 
onal psychology in teacher education are suggested: 

l e study’s confirmation of the existence of considerable 
geneity of psychological proficiency among prospective 
rior to formal training in educational psychology re- 
e general principle that instruction in this field, as in 
ds to be geared to the differences that exist. Further- 
only does this point to the need for caution in the inter- 
of objective tests used as measures of achievement in 
Onal psychology courses, but it also points to the need for 
hetrating analyses of the psychological proficiencies of 
‘training both before and after instruction in educa- 
Psychology. Until such study is made it would seem difficult 
standards of psychological proficiency for teachers that 
tie and valid. j 
finding of a positive relationship between the attitude 
the criterion tests suggests a host of Dro con- ` g 

top 
PS 


rôle of values in psychology instruction. The f. 
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close examination of both criterion tests revealed that the testee 
was required to make judgments not only of correctness of fact 
and general principles that account for human behaviors, but also 
of the correctness of certain procedures that depend upon value 
chajces as, well, suggests that educational psychologists need to 
attend more to the ways in which these value positions are learned, 
The question of when and how the ‘should’ questions are best 
handled in relation to the questions of ‘what’ and ‘how’ are in 
need of study. 

3) The apparent lack of relationship between the amount of 
experience with children and the two criterion measures reinforces 
the conclusion that first-hand experience per se is not likely to 
enhance one’s psychological understanding. Current emphasis 
in some teacher-education programs upon early field experiences 
may need to take into account more critically the types of guidance 
needed in such experiences if mature and valid understandings are 
to result. 


BOOK REVIEWS 


-Science and Human Behavior. New York: The 
Company, 1953, pp. 461. $4.00 


en Two, the former being a report of his investigations into 
logy of sh and we second being a novel which deals 


rist’, taking on the mantle which John B. Watson let 
two decades ago. Skinner says in brief that behavior is 
r datum concerning the mind with which science can deal 
e conceives of psychology strictly as the science of behavior. 


tionships that have a certain degree of generality and perma- 
He feels that the only relationships with regard to behavior 
lich | science can deal are those that connect behavior with 
‘physical environment. Therefore, to Skinner behavior is 
ndent variable and the environment is the independent 
d the task of scientific psychology is to find the func- 
tionships between these two sets, of variables. 
has no use for so-called intervening variables—con- 
ch other psychologists have found necessary to set up 
plain mental processes. A large part of this book is 


es which have been proposed both in the common-sense 

“everyday life and also by professional psychologists, and 
tered the phenomena which these intervening variables 
rms of situation voc response. Sometimes it is almost 


y sim difficulty by the common man and retranslate it 
1 terms of reinforcement, extinction, discrimination and 
m, What Skinner has done in the book is to set up a new 
hich will be consistent throughout in terms of a few 
ich he believes to be basic to a psychology which is 


this book i is more than a mere psychology, for i in i m 
am and a philosophy. Skinner says that ‘ 
121 


p. 


122 h Book Reviews 


more than a mere description of events. It is an attempt to dis- 
cover order, to show that events stand in lawful relation to other 
events.” (p.6) “Science not only describes, it predicts... nor is 
prediction the last word. To the extent that relevant conditions 
cayy be altered or otherwise controlled, the future can be con- 
trolled.” This then is the keynote of the book. Skinner is interested 
in building a science of behavior in order to be able to control 
behavior. 

After he finishes making his analysis of behavior, he turns to the 
problem of control. First, in Chapter 15, he describes self-control, 
and later, in Chapter 20, what he calls personal control and, in 
Chapter 21, group control. This is followed by a discussion of vari- 
ous controlling agencies in society, including government and law, 
religion, psychotherapy, economic control and education. 

The final section of the book discusses the problem of human con- 
trol in general, including such issues as, who shall control and for 
what purposes. Skinner belittles value judgments and believes 
that they have no place in the science of behavior. He then pro- 
ceeds to propose his own basic value and make it the fundamental 
guiding principle on which all decisions with regard to control shall 
be made. In doing so he draws a parallel between the process of 
natural selection in the biological world and the process of selection 
of behavior in society. The guiding principle in biological selection 
is the survival of the Species. In similar fashion he believes that 
the guiding principle in the selection of behavior is the survival of a 
culture. He says “a given culture is in short an experiment 10 
behavior” (p 430) and he believes that the final criterion of behavior 
is the survival of a culture, and the problem of estimating the sur- 
vival value of a culture is one for experimental determination. 
Skinner places greater value on the culture than on the individual, 
with all the social implications that such a point of view carries 
with it, He recognizes that this emphasis on the culture goes con- 
trary to the emphasis in western democracy on the individual, 
but he believes that his thinking leads him inevitably to this com 
clusion. 

Section II of the book is a summary of Skinner’s views about 
learning, and anyone who wishes a compact statement of Skinner s 
point of view will find it in this section. Skinner, of course, was 0D? 


of hie to propose a two-factor theory of learning, and eo 
#),0n is repeated in this book in terms of stimulus substitutio 
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very interesting n of intermittent reinforce- 
nd shows some of its practical implications. It will be re- 
d that some of the first experimental findings with regard 
ittent or partial reinforcement have come out of Skinner’s 
Skinner pays much attention to adversive behavior and 
behavior and there is a good discussion of punishment, 
on contributions of Estes who did his researches under 


ening variable, but when he comes to what he calls ‘pri- 
ie events in a natural science’ and the topic of the self, he oc- 
a y slips from the rigorous position which he has attempted 
Although he apparently is reporting Freud’s point of view 
284, he says “multiple selves or personalities are often 
systematically related to each other. Freud conceived of 
ni 060, Superego, and id as distinguishable agents within the or- 
sii... the superego and the id wereginevitably opposed to 
and Freud conceived of them as often in violent Conflict. 
l pealed to a third agent—the ego—which, besides attempting 
compromise between the id and the superego, also dealt 
E ' practical exigencies of the environment. We may quarrel 
ny analysis which appeals to a self or a personality as an 
miner or action, but the facts which have been repre- 
th such devices cannot: be ignored. The three selves or 
s in the Freudian scheme represent important charac- 
of behavior in a social milieu.” 
mtly Skinner has made no effort to translate these in- 
ariables—the id, the superego and the ego—into the 
behavioristic categories but says “the facts which have 
presented with such devices cannot be ignored.” 
Viewer read with interest Skinner’s discussion 3 the five 
agencies in society, but feels that he has unfajriy de- 
least two of them; namely, religion and cdi 


4 
fies 
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ner has described institutionalized religion and education as they 
now exist‘in current society, and has not given sufficient credit to 
religion and education which may be more of an inner, personal 
matter. In neither has he given sufficient credit to self-control, but 
hag, thought of both of these agencies as representing a fully ex- 

alized control by priests and teachers. As a matter of fact, 
because Skinner is so concerned with only relating behavior to 
external events he fails to give proper place to socalled inner 
or self-controls. In the discussion of self-control he rightly points 
out that all self-control is an outgrowth of external control. He 
fails, however, to give recognition to the fact that these external 
controls may have existed in the past and dropped out of the pres- 
ent picture so that for all intents and purposes these controls have 
been internalized or introjected. In omitting the recognition of 
these internalized controls as factors in religion and education he 
has given only a narrow and partial presentation of these control- 
ling agencies. 

Skinner seems to believe that control in education is very much 
a matter of punishment, and apparently does not recognize other 
types of control which are of a more positive nature. Evidence 
which this reviewer has gathered points conclusively to the fact 
that those teachers who depend most upon punishment as a means 
of securing control are among the least successful as teachers. This 
would point to the faet that there are other types of control in 
education to which Skinner has not given his attention. 

The publishers of this book in their advertising refer to this 88 
“an introductory psychology book.” Certainly Skinner himself 
can be under no illusions that this is an introductory text. As 4 
matter of fact, even the advanced student may find this book 
tough reading because it presupposes a rather thorough acquaint- 
ance with a vast amount of experimental material. This is a book 
without a statistical table and without the formal report of a single 
experiment. The unsophisticated reader may get the impression 
that Skinner has derived much of his material from the observation 
of and experimentation with pigeons because there are so many 
references to Skinner’s later work with pigeons. 

Any serious student of psychology, however, who wants to learn 


exaci S Skinner’s system is and what applications he would 


q 
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e practical world of affairs, will find this book a 


or reading and study. 
f PercıvaL M. SYMONDS 


oe 


ich Columbia University 


many anaia clinical interpreters have other 

to pick up by observation and imitation. The procedure 
oversimplified or stereotyped. The authors firmly 
very fact must be related to ever other, and they 
i Suggestions to the utmost frontier of Rorschach inter- 
m. No cue is too faint, no relationship too speculative, to 


e of the Book is that the authors are aware that 
tions may be invalid. Their first paragraph i is re- 


hach performance and other behavior. Unfortu- 
the relationships which are asserted have been cor- 
‘represent guessed-at laws. They are perhaps best 


proceeds by successive refinements, setting up ten- 
mces on Card I which constitute predictions about sub- 
tds; they can therefore be confirmed or modified (p. 261). 
i hypothesis that results after ten cards then requires 
lidation” from further data. It is in not stressing this 


ite their sophistication and their excellent section on 

and Smith do not succeed in maintaining their 
skeptical attitude throughout the book. 

t the writers do reëxamine current doctrine, they make 

. They often explain why interpretations are. pro- 


ng the Rorschach interpretation that writers on the , 


œ 


develop much improved criteria for RNE g 
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and form level. They describe the interpretation of ‘style’ or non- 
scored behavior most helpfully. They present new developmental 
norms derived from theses by Hemmendinger, Siegel, and Fried- 
man. The reviewer does not feel that their genetic approach to 
i retation is always helpful. To say that an adult has a score 
common &mong children is not explained by describing this as a 
“genetically early level,” or describing him as fixated at or regressed 
to this level. The significant point is that he has now acquired a 
positive way of behaving with his present organic potential, pre- 
sumably because this way of behaving is relatively rewarding. 
Wherever analysis in genetic terms appears, it seems to crowd out 
discussion of learning processes or dynamics generally. 
Recognizing the good features of the book, one still is startled 
by much of its content. “Shock on Card I is evidence for an unre- 
solved and intense relation with a mother figure . . . Shock on Card 
II is characteristic of ... uncontrolled destructive impulses.” Re- 
marks like “any time limits are characteristic of persons with obses- 
sive-compulsive character features,” “The concept ‘teeth’. . . ap- 
pears to be associated with masturbation and/or sibling rivalry a8 
a reaction to frustrated dependency needs.” Such comments are 
generally expanded in a way which gives them plausibility, yet 
the fact that the cables of evidence anchoring them to reality have 
all the tensile strength of moonbeams, and the authors’ introduc- 
tory warning that what they say may be either true or false, caus? 
one to question just how such writings can have a place in a science. 
Tf this book were read by a man from some ‘ivory tower’ who 
happened not to know that the Rorschach is currently taken ser! 
ously, it would sound to him like Ralph 1245C+ sounded to its first 
readers. Good science fiction is written with logic, any known facts, 
and imagination; at its best, its plausibilities foreshadow some 
things that science later establishes. Perhaps it is fairer to say that 
Rorschach interpretations are to the science of psychology a8 bis- 
torical novels are to history. The writer seeks internal consistency 
and plausibility, with due regard for established fact. Where there 
is a gap in fact, the scholarly historian acknowledges the gap, but 
the fiction writer fills it with what might reasonably have happened, 
i.e., with an artistic creation. P 
ent between scientist and clinician disappears ive 
clinical interpretation is an art at the present time. In the 
of testing it was a technology, wrong at times but nev 
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reative or intuitive. Now, instead, the clinician is 
all a portraitist who abhors unfilled areas on the canvas. 

because there is a practical demand for answers about 
‘or uncertainties. Society has rewarded this new species 
ye can scarcely criticize him for satisfying the dema=d. 

in the field want to be successful artists, und books Ñ < 

present one which tell how to complete an attractive and 
ng ‘portrait are wanted. When authors like Phillips and 
1 also help keep portraits closer to reality than they would 
that is so much gained. Once we grant that the clini- & 


elcome any book which contributes to making him a 
cientist. 
ntist cannot complain if the portrait is used as a guide 
inquiry about the patient or about personality in gen- 
h science fiction and historical fiction have suggested leads 
rious investigations of their fields. This emphatically demands 
i clinician’s suggested interpretations should send further 
| promising directions, not up ‘blind alleys’. There is good 
think that the Rorschach or other clinical interviewing 
j can do this often enough to be useful, even though, ac- 
the evidence, their suggestions on any case are not cor- 
igh to be trusted without further support. 
entist should object when he is told that he or the practi- 
io receives a Rorschach analysis should believe it. Among 
oks yet written on the Rorschach, the book under review 
losest to filling a practical demand for training clinicians to 
portraitists, without making unacceptable claims that 
its are dependable. LEE J. CRONBACH 


Garpyer. Fundamentals of Neurology. Philadelphia: W. 
ders Company, 1952, pp. 359. 


ae 3 
ology is, in this second edition of the textbook on the subject 
‘he first edition, presented as a seriés of basic concepts— 
din the light of growing knowledge in the field. The material 


Med for inclusion is based upon dissection and miae japie j \ 
the nervous system, experimental data, and th@yy 1 A i 
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neurological em In the nineteen chapters of this small 


volume are included a consideration of functional as well as struc. | 


tural topics. Also included is a glossary of the many new terms of 

Greek and Latin sources, and an index. Each thapter ends with 

a }rief biographical sketch of the people who have contributed to 
topies considered in the chapter. 

» Psychologists will be interested in this text on neurology as 


such and also in the treatment of the problems of learning and — 


language mechanisms in the last chapter called “Associative Func- 
tions of the Cerebral Cortex”. Treatment of learning and con- 


©» ditioned responses are presented in a very compact manner, and 


the frame of reference for. the consideration can be gathered from 
the psychological sources used by the author. These include Hil- 
gard and Marquis’ Conditioning and Learning, Lashley’s Brain 
Mechanisms and Intelligence, MeGeogh’s Psychology of Human 
Learning, Murchison’s Handbook of General Experimental Psy- 


chology, Pavlov’s Conditioned Reflexes, and Young’s Emotion in : 


Man and Animal. 
Psychological Service Center i H. MELTZER 
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re on trait variability within the individual is slight 
ith that on individual differences. However, with the 
st in profiles and patterns, there is likely to be more 
paid to the subject in the future. Formerly, the emphasis 
mt was on the level of fe ane, but the mounting 
0 profiles has broadened the focus to include, also, the 
f performance. This statement applies to the use made of 
up and individual test results for purposes of guidance, 
j and prediction. 
ide range of individual variability reported by Hull? was y 
ated by the later, investigations of Foran, Lillie and ws 
of Stout. And other studies, only incidentally con- 


mess. This variability or unevenness is described 
y by some type of differential score and is represented 
by a profile. Schlesser’s study® is the only one, to the 


of a dissertation prepared under the direction of Professor 
and submitted in partial fulfillment of the requirements for 
Of Doctor of Philosophy at Yale University, June, 1949. 

L. Hull, Aptitude Testing. Yonkers-on-Hudson N. Y.: World 

, 1928, p. 47. 
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Educational Research Bulletins, Washington, D.C.: The» 
Iniversity Press, 1928, p. 23. i 

» “Variations of normal children,” Journal of Experimental 
84-100, September, 1937. 

E Schlesser, “Development of special abilitie 
Journal of Educational Research, 40: 39-51, Sed: 
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writer’s knowledge, which has dealt with the reliability of this 
profile. In comparing the reliabilities of ability scores with the 
reliabilities of the differential or unevenness scores obtained from 
an achievement test battery, Schlesser found that even in the most 
Teliahly tested ability, slightly less than twenty per cent of the 

a trai:Uifferexces could be said to have significance. 

i Little work has been done to describe how stable the profile is 
from, one test performance to the next. There may be much ir- 
regularity in the effort put forth by individual pupils in taking a 
group intelligence test. Some may work hard on one subtest and 

& let up on the next; others may work consistently throughout the 
test battery. Such irregularity of effort would modify the shape of 
the profile essentially caused by trait differences. These possibili- 
ties raise two questions: 

1) To what extent is theshape of the profile related to the order 
of subtests? 

2) Is the shape of the profile related to the interval between 
subtests? i 

The relation between the extent of unevenness and educational 
achievement has been studied. DeVoss* found only slight and in- 
significant differences between the unevenness of abilities of gifted 
and unselected children. Gray’ found individual variability in edu- 
cational achievement somewhat greater in the dull group than in 
either the average or bright group, but agreed with DeVoss in 
finding no true difference between the average and the bright. 
Measuring achievement by college marks instead of a standardized 
test, Rausch’ found students with the greatest variability to have 
the poorest achievement in college. Pressey? found no evidence of - 
any real difference in the amount of unevenness between the dull, 
average and bright. Merrill,” on the other hand, concluded that 


“L. M. Terman, Editor, Genetic Studies of Genius. Stanford University, 
Stanford University Press, 1925, chapter 12. 5 ” 

7 Susan W. Gray, “The relation of individual variability to intelligence, 
The Journal of Educational Psychology, 35: 201-210, April, 1944. Ren! 

t Oscar P. Rausch, “The effects of individual variability on pe 
ment,” The Journal of Educational Psychology, 39: 469-478, December, 1 Hs, 

? S. L. Pressey, “Irregularity on a Binet examination as a measure of i 
reliability,” Psychological Clinic, 12: 236-240, May, 1919. ‘ail 
A aud A Merrill, On the Relation of Intelligence to Achievement in a 
e Pa wh y“ I Children. Comparative Psychology Monograp 
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eed with these conclusions after making a criti- 
We odrow’s data. From data of his own, Tilton 
inferences that resulted from his re-examination of 
; namely, an inverse relationship between uneven- 
later study Tilton“ found that unevenness was 
to progress in school. Pupils accelerated for 
'e more uneven than the pierade group, and 


and to raise these questions: Is dullness associated 
average unreliability in taking group intelligence 

eater unevenness of the accelerated group at school 
f unreliability? Another direct relationship which 
rted was between increased unevenness and ad- 
age. Might this direct relationship, also, be a func- 


wn, The Unevenness of the Abilities of Dull and of Bright 
York: Teachers College Contributions to Education, No. 


drow, ‘Mental unevenness and brightness,” Journal of 
ology, 19: 289-302, May, 1928. 

he relation between IQ and trait difference as measured 
en » Journal of Educational Psychology, 38: 343- 


W, Op. F p. 294. 
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e 


the dull due to their greater unreliability? (2) Is the increased w. 
evenness in advancing mental age due to unreliability? (3) And is 
the greater unevenness of the accelerated at 8chool due to unre 
liability? 

ð a 


2 Q PROCEDURE 


The experiment was conducted in the junior high schools of New 
Haven, Connecticut. The test used was the Terman-McNemar 
Test of Mental Ability, Form C, which is a battery of seven sub- 

SS tests. It was administered in all the schools in the usual manner 
A week later it was repeated in the same way to approximately 
one-third of the population, to be known in this study as the 
standardized group. After the same interval it was administered, 
by reversing the order of the subtests, to another third to be known 
as the separated group, by spacing each subtest on consecutive 
school days. The interval of one week was thought to be short 
enough to obviate marked gains due to learning, and long enough 
to minimize practice effects. The experiment was so designed in 
order to determine if the unreliability affecting profiles is changed 
by these two variants in standardized administration. 

Since Woodrow'* had found a marked increase in unevenness 
with advancing mental age, it was necessary in this study to hold 
mental age constant in order to keep this relationship under col: 
trol. The range of mental age levels selected for the sample wi 
dependent on those levels for which the test was adequate. When 
the distributions of the various mental age groups were tested e 
skewness, the 12:0-12:5 was the lowest age group and the 14:6- 
14:11, the highest found to be without skewness. Accordingly, the 
sample selected included the six half-year mental age groups from 
12:0-14:11, inclusive, which numbered 1138 cases. At each of thes? 
levels each subtest distributed the members of the group satis: 
factorily and without skewness, $ 

Inasmuch as it was necessary to express each subtest score 5 
comparable units for each of the six mental age groups, all or 
were converted into sigma scores. For this purpose, the total sam 
ple was divided both on the basis of sex and half-year mental a8° 
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differences in unreliability? If so (1) Is the greater unevenness of 
` 
it 
levels. The terms unevenness score, differential score and tr®' 
i 
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used interchangeably to mean the difference 
ividual’s average sigma score and his sigma score 
alar subtest. Hence the average unevenness score for 
ual was the mean deviation of that individual’s seven 


ability referred to in this study is the inconsistently of 
ce from one administration of the test to the next, 
‘orm. To measure the unreliability of each pupil’s 
score known and defined in this study as a dis- 
? score was adopted. D scores were derived in two 
7s: one set from raw scores, one from sigma scores. A 
‘was defined as the sum of the differences, after cor- 
made for practice effects, between the raw scores on 
second administration of the test. The D scores were 
sigma scores by the same method except that there 
to correct for practice effects. It may be noted 
ection or sign of the difference is of no importance in 
g the individual’s unreliability. The larger the individual’s 
greater is his unreliability in taking the test. 


a 


THE RELIABILITY OF THE PROFILE 


profile may be described with unevenness scores, its 
ity can be measured by them. It is of course recognized that 
profile is due to the unreliability of measurement. 
often ignored and the profile is misused because too 
ata is put in its precision. The question arises: how high a 
t how low a dip must there be in a profile before we can 
that it indicates an ability or disability? The reliabilities 
unevenness scores, which measure trait differences in the 
, are compared with the reliabilities of corresponding 
‘om which the profiles were made, as shown in Table I. 
column in this table gives the reliability of the abilities 
d by the seven subtests from the two administrations 
battery. An ability will be defined as the standard score 
me on test 1, 2, 3, etc. The average of each pupil’s 
id scores will be called the pupil’s general level of ability. 
ability, or disability will be defined as his standard score 
test minus his general level or average. Thus the peaks 


i the profile above and below the general J@yel i 
Tepresentations of special abilities and dř jes, 


Me 
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TABLE I.—Comparison OF THE RELIABILITIES OF TRAIT AND 
Trarr DIFFERENCE Scores ror THE STANDARDIZED GROUP 


ii © Standard Error of 
Subtest | Reliabilities of Abilities | Reliabllities of Traic Measurement of Trait 
> he 64 62 683 
2 32 +29 -860 
3 72 -50 -566 
4 54 43 782 
5 -78 61 657 
6 47 34 852 
7 66 51 +783 
av. z = .707126| av. z = .522644 
av. r = .605 av. r = .480 


spectively. And the term ‘trait difference’ heading the second eol- 
umn will be used to mean either a special ability or disability. 


(The reader is reminded of the interchangeable usage in this article 


of the terms trait difference score, differential score and uneven- 
ness ‘U’ score to measure these peaks and dips above and below 
the general level.) Here also the reliabilities from the two test 
administrations were calculated. è 
Schlesser!” reported a similar comparison of reliabilities of abil- 
ities and trait differences. Using the severe test of three standard 
errors of measurement from the mean before he admitted a reliable 
differential score, he found that, even in the most reliably tested 
area, only about twenty per cent of trait differences were signif- 
cant. Applying this same criterion for significance to the present 
data, only about five per cent of the trait differences in the most 
reliably tested area (subtest 5)'are significant. More general prat- 
tice is to regard a critical ratio of 1.96 as being sufficiently high t0 
indicate significance. An examination of the present data with this 
adjusted criterion, indicated that twenty-five per cent of the peaks 
and dips are significant in the most reliably tested area (subtest 5). 
In the least reliably tested area (subtest 2), ten per cent are signifi- 
cant. ; 
From these findings it is concluded that most of the irregularities 


of a profile, as measured by the test used, are due to errors in mee ' 


wes IT op. cit., p. 41. 
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erentiations are reliable only about twenty- 
in the most reliably tested area. 


ONS OF UNRELIABILITY OF PROFILES TO TEST 
ADMINISTRATION 

hich the shape of the profile was related Ñ the 
investigated by comparing the standardized 
versed group. The practice effects as indicated 
compared for these two groups and none of the 
gnificant. Furthermore, the range and standard 
onding subtests for the two groups were prac- 


ne, Additional evidence of the similarity in shape of the 


two groups was found by comparing their average 
fndividual’s unreliability or change in taking the 
istration to the next) computed both on the 

res and sigma scores. In both cases the standard 
ence far exceeded the obtained difference between 
comparison made of the reliabilities of the trait 
on each subtest showed no significant difference 
of the two groups, nor did it show any consistent 
differences between the reliabilities of correspond- 
Surprisingly enough, subtest 4, the relative position 
not changed in either group, showed the greatest 


parison of these two groups was made of the relia- 
erage U, and Us scores as shown in Table II. 
s scores from the two administrations of each sub- 
lated. Then these r’s were converted into z values 
nd the average z’s converted back to r’s. It will be 
difference on the whole between the two groups 
ant, nor was there any consistency in the direction 
ces between coefficients of corresponding subtests, 
ows. The standardized and reversed groups were 
ilar by all the criteria applied: average gain, meas- 
of central tendency and spread; discrepancy 
test to retest; reliability of both subtest scores and 
The reliabilities of the subtests showed a criss- 
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TABLE II.—Comparison or RELIABILITY COEFFICIENTS oF 
Unevenness Scores ror THE STANDARDIZED 
AND REVERSED GROUPS 


Subtest Standardized Group Reversed Group © SE of Difference 
1 -62 -53 
ei hd .29 44 
3 -50 .54 
4 43 49 
5 61 56 
6 34 -36 
7 51 -50 


. 58787 + 02598 


f= å 
.r = .490 C.R. = .566 


crossing in the direction of the differences, which was hard to at- 
count for, from the hypothesis set up. Also, the test showing the 
greatest amount of change was the one whose relative position was 
the same for both groups; the reliability of scores on the first and 
last subtests, on the average, does not seem to have been affected 
by the reversal. For these reasons it would seem that it was not 
the reversing which caused the changes, but other unaccounted 
for factors. Hence it may be concluded that the order of the sub- 
tests is not related to the shape of the profile. Whatever unrelit- 
bility there is in the profile is not due to the order of subtests. 
Does a battery of subtests all taken at the same time, icy at 
one sitting, yield the same profile that would be obtained if the 
several tests were taken at different times? A comparison of the 
results for the standardized group with those for the separ 
groups provides an answer. When the practice effects of these tw? 
groups were compared none of the differences were found to be 
reliable. In addition, the two groups were practically the same M 
regard to range and standard deviation. Further evidence of the 
similarity in the shape of the profiles of these two groups was 0b 
tained when a comparison of their average D (unreliability) sco 
was made. The standard errors of the difference between thes? 
scores, computed on the basis of both raw score and sigma scof 
ata, far exceeded the obtained difference. 
comparison of the reliabilities of subtest scores (trait mea 


Pa 
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t while the average coefficient of the separated 
ntly lower than that for the standardized group, 
of the differences was inconsistent. This, coupled 
t thatdnly two of the subtests were significantly lower 
ited group, led the writer to compare the separated with 
‘Accordingly, the reversed group was cgmbinefjwith 
d group since the two had been found not to be® e 
Compared with this enlarged group the difference 
ability for the separated group was not reliably lower, 
there was no consistent direction in the differences 
the subtest coefficients for the two groups. Only one sub- g® 
was significantly lower in the separated group than 
bined group. Finally, the reliabilities of the average U, 
ares of the separated and standardized groups showed 
) differences; nor was there any consistency here in the 
of the differences between the reliabilities for the cor- 
subtests, as examination of Table III will show. 
a e, the standardized and separated groups were similar 
to the measurements described above. Hence it is 
t the separation of the subtests (by the interval used) 
to the shape of the profile. The unreliability of the 
ot the result of the interval between subtests. This find- 
We practical implications for the user of tests: The ad- 
of many tests requires a longer time than one class 
tester need not feel hampered by the demands of such 


0 III.—COMPARISON or RELIABILITY COEFFICIENTS OF 
JNEVENNESS SCORES FOR THE STANDARDIZED 
AND SEPARATED GROUPS 


Standardized Group Separated Group SE of Difference 


.62 -50 
-38 
.51 


-36 
49 


av. z = .487912 
.480 av. r = .450 
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time restrictions but can separate the battery to fit the time 
schedule. 


z 
RELATION BETWEEN UNRELIABILITYSAND IQ 


A third purpose of this study was to investigate the cause of 
indigjHual differences found in unevenness. These data were first 
examined for the presence of the inverse relationship between 1Q 
and unevenness which was reported by Tilton.!® The findings were 
adequate to justify a search for any possible relation between un- 
reliability and IQ. 

& The reader is remined of an earlier statement regarding the use 
in this study of a discrepancy score (D score) to measure the un- 
reliability of each pupil’s performance. For the total sample the 
reliability of the discrepancy scores was .08 with a SE of .028. 
(When this r is stepped up by the Spearman-Brown formula it 
becomes .15.) While an r of this size indicates a very low reliability 
without practical significance, it more than passes the one per 
cent level of statistical significance.” There is therefore a slight 
tendency for some individuals to be more unreliable than others 
Are these more unreliable pupils the dull pupils? 

The D scores for the dull, average and bright groups were a8 

. shown in Table IV. A 

In determining the probability that the dull and average will 
always exceed the bright in unreliability, only the differences in 
one direction are involved: that is, (1) excess of unreliability of the 
dull over the bright, and (2) excess of unreliability of the average 
over the bright. These differences are reported in Table V. A differ 
ence between the dull and bright of this magnitude (2.47) and in 
this direction passes the one per cent test. In the same way the 
difference between the average and the bright satisfies the tw? 
per cent test, and the difference between the dull and average 
combined, and the bright more than satisfies the one per cent test. 
And the findings are very similar when the discrepancy scores Wel? 
computed from raw score data. Evidently the bright are more 1è- 
liable than the dull and average. It has been shown therefore that 
the dull and average are not only more uneven, but are also more 
unreliable; and that the bright are not only less uneven, but 


19 J, W. Tilton, loc. cit. ton: 
7 "E. F. Lindquist, Statistical Analysis in Educational Research, Bos 
HE Yin Co., 1940, p. 212. 


per 


liability of Group Test Profiles 139 


IQ 95-104 IQ above 104 
63.19 (421) 59.63 (345) 
20.91 20.80 
1,02 1.12 ® 


=C OMPARISON OF THE DIFFERENCES IN DISCREPANCY 
18 FOR DIFFERENT IQ GROUPS (SIGMA SCORE DATA) 


Difference Btandend Error of G E 
-35 1.51 23 
3.91 1.58 2.47 
3.56 1.51 2.36 
3.72 1.35 2.76 


hfe $ 

reliable, Hence, the negative relationship between uneven- 
hich has been reported could be a function of indi- 
as in test-retest unreliability. 


IN BETWEEN UNRELIABILITY AND MENTAL AGE 


she relation found between IQ and unreliability, 
s to the possible relation between unreliability 
sing raw score data the average discrepancy 
12.95 and 13.13 for the twelve, thirteen and four- 
vel, respectively. Although these scores tend to 
th advancing mental age, they are not consistent in this 
differences are not significant (critical ratios of 
| Tespectively). Hence these data provide no basis 
to unreliability the relation which Woodrow” found 
ased unevenness and advancing mental age. 
arisons were made using the D scores derived from 
e, however, IQ has been held constant lest the 
between IQ and unreliability, which has been 
‘should complicate the findings. For the average 
epancy scores are 65.01, 62.32 and 62.67 for 
n, and fourteen mental age levels, none 


py 


op. cit., p- 204. x biga 
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These D scores also decrease with advancing mental age and ina 
pattern similar to that of the other D scores. The slight deviation 
from the general trend of the fourteen mental age level is not sig- 
nificant. In this case, also, the differences are not significant; criti- 
cal ratios are 1.01 and .91, respectively. Accordingly, there is no 
9 basidi thess data for thinking that the relationship which Wood- 
“ row reported was due to individual differences. In fact, the differ- 
ences here are in a direction opposite to that which would be neces- 
sary to explain Woodrow’s findings on the basis of unreliability. 


3 


S RELATION BETWEEN UNRELIABILITY AND ACCELERATION IN SCHOOL 


Reference was made in the Introduction to a direct relationship 
found by Tilton® between unevenness and acceleration in school: 
the accelerated for their mental, age being more uneven than the 
at-grade group, and the retarded less uneven than the at-grade. 
Could it be that the greater unevenness of the accelerated is due, 
to a greater unreliability on their part? Since unevenness has been 
found to be a function of IQ, the relation between unevenness and 
acceleration in school should be studied by holding IQ and mental 
age constant. For this reason IQ was held constant between 95 and 
104. This average IQ group (95-104) of each of the three mental 
age levels of the sample was divided into the retarded, at-grade 
and accelerated on the basis of their grade placement in school. 
On the basis of normal progress through school, those in the twelve 
mental age group were considered at-grade if they were in the 
seventh grade and accelerated if in the eighth or ninth; those in 
the thirteen mental age group were considered retarded if they 
were in the seventh grade, at-grade if in the eighth and accelerated 
if in the ninth; and in the fourteen mental age group, retarded if 
they were in the seventh or eighth grades and at-grade if in the 
ninth. It is obvious from the foregoing that only the thirteen 
mental age group here contained all three classifications. Because 
of the incompleteness of the twelve and fourteen mental agè 
groups for this classification, with the resulting small number of 
cases in the retarded and accelerated groups; and because no Cue 
sistent pattern of unevenness was revealed for mental age levels; 
all three were thrown together for analysis. The mean unevenness 
score for the average group (95 to 104 IQ) was 80.38 for the 1°- 

(Prrded group (N = 60), 83.11 for the at-grade group (N = 287 


? W ? Tinon, unpublished manuscript, 1948. 
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he accelerated group (N = 74). While the differences 
they were in the same direction as reported by 
the relation between unevenness and IQ, these 
be èxpected to show as pronounced differences as 
ey include only a three-grade range, while his 
five-grade one. In view of the foregoing, thegdata 
suitable for an investigation of the relationship ® 
ability and acceleration in school. It will be re- 
the other step preliminary to this investigation— 
e unreliability scores are statistically reliable—has 
ted in a previous section. E 
e average IQ group (421 cases) was examined for un- 
ith these results: retarded 13.58; at grade 13.37; ac- 


ical significance. (Critical ratios of .71, .87, and .60) 
oted, too, that the trend of the differences in the reli- 
hese school groups is just opposite to that which one 
in order to explain the direct relation which Tilton re- 
ween unevenness and acceleration in school according to 
comparison of the unreliability of the three school 
‘made on the basis of the D scores computed from sigma 
vith these results: retarded 61.00; at grade 63.64; accel- 
. Here, also, the differences were not significant (the 
rs being in each case larger than the obtained differ- 
hese data there is, then, no evidence that the relation- 
unevenness and acceleration may be caused by 
Tn other words, the greater unevenness in the profiles 
ated is not the result of their greater unreliability. 


SUMMARY 


eem to justify the following conclusions, within the 
st used and subject to the conditions of the investi- 


is dependable evidence that much of the unevenness 

Sis due to test unreliability. In the most reliably tested 
ut twenty to twenty-five per cent of the time are 

ential scores or unevenness scores reliable; and in thé 


hse 
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least reliably tested areas only about five per cent of the time are 
they reliable. These findings have implications for the guidance 
worker who should be cautious about considering all high and low 
differential scores as evidence of a special ability or disability. 

2) The relation of the unreliability of profiles to test administra- 
tionjvas negligible. Profiles were not changed significantly by the 
two variations of administration investigated. These findings may 
have implications for the makers and users of tests: a test battery 
may be broken up without fear of its effect on profiles. 

8) Unreliability itself was found to have a very low but statisti- 
cally significant reliability. i 

4) The unreliability of profiles was found to be inversely related 
to IQ, in that the bright were found to be more reliable than the 
dull or average in taking group intelligence tests. From this finding 
it seems reasonable to conclude that the greater average uneven- — 
ness of the dull is caused in part at least by their greater unrelia- 
bility. j 

5) The data provide no basis for attributing the direct relation- — 
ship between average unevenness of profile and mental age, to 
unreliability. i 

6) And further, the data provide no basis for attributing the 
direct relationship between average unevenness of profile and at- 
celeration in school for mental age to unreliability. 
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TERED VERSUS INSTRUCTOR- 
TERED INSTRUCTION’ 


W. J. McKEACHIE 
' University of Michigan a 6 


have their stereotypes. And for most of us 
and ‘nstructor-centered’ are stereotypes. With 
we associate the halo terms of democratic, per- 
fective and student growth. ‘Instructor-centered’ 


tent-centered. In our psychological subculture 
title stack the deck against anyone who at- 
e instructor-centered point of view. 

onceptions, there are important areas of agree- 
e agrees that we're teaching students and that 
promote student learning. Also, almost everyone agrees 
improve our students’ problem-solving skills. This 
three things: 

“up our stereotypes by examining some of the di- 
may differentiate student-centered from instruc- 
ruction. 

-à group of research studies on the problem. 

e theorizing about the problem. 


DIMENSIONS OF DIFFERENCE 
he dimensions of difference? 


GOALS 


rominent is the dimension of goal setting. The 
ed teacher believes that he is ultimately respon- 


Jornell last year, “If the teacher merits the respon- 
his hands, he knows more than do the students 
, about the world in which we live, and of the 


presented in an APA symposium, “Student-centered 
mtered Instruction,” sponsored by Divisions 2 and 1 
ionds as chairman, Sept. 2, 1952. 


rms authoritarian, Fascistic, knowledge for its _ 


ng goals. To quote the report of a study group — 


143 eho, 
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ways in which a knowledge of psychology can enrich the world” 
(8). The student-centered instructor, on the other hand, believes 
that the group, including both students and instructor, should 
determine the group goals. 
Despite his greater emphasis upon the student in goal-setting, 
theltudens-centered instructor usually has certain implicit goals 
ov which he hopes will be achieved by the students. Thus another area 
of difference between instructor-centered and student-centered 
teaching is in the type of goals for which they are aiming. This area 
of goals needs to be plotted in at least two dimensional space be- 
SD cause there are three differing types of goals: intellectual, applied, 
and affective. 

One approach emphasizes the traditional intellectual goals of a 
liberal education. It attempts to create an interest in ‘Knowledge 
for its own sake’. The primary goal is to teach students to think. 
The instructor with this approach may be interested in attitudes, 
but they are the attitudes of the Scientist toward his subject mat- 
ter, not social attitudes. These are the goals which we usually as- 
sociate with the instructor-centered method, but which may be 
independent of other dimensions of the methods. 

The instructor who is interested in applied goals feels that psy- 
chology can offer much that is useful in the student’s daily life, 
not only in adult life but in college as well. He is interested in 
teaching the facts of psychology only insofar as they can be applied 
by the student. Specific skills such as reading skills or study habits 
are apt to be part of the content of his course. , i 

The instructor primarily interested in affective goals is likely 
to disavow any suggestion that his class is simply group therapy: 
Tn spite of this, he is even more dissatisfied than most teachers with 
achievement tests as a criterion of his teaching, and he is quite w- 
happy if his students don’t have some adjustment problems that 
their psychology course can cure. 

Student-centered instruction tends to emphasize the last of these 
three goal-emphases, but none of these goal-emphases are insolubly 
_linked with other dimensions of teaching. Benjamin Bloom, for 

example, has been much interested in the discussion method a8 ê 
technique for attaining intellectual goals. 


METHODS OF TEACHING 


Prokgbly, most of us think of student-centered instruction n 
f' fe from instructor-centered teaching primarily in terms 


-centered versus Instructor-centered 145 


n in the classroom. One of the dimensions here is degree 
nt participation. The student-centered instructor tends to 
se jgh degree of student verbal participation. Most stu- 
instructors, however, are not satisfied if student dis- 
directed at the instructor. They are interested in 
high degree of inter-student participation. Thdbtu- 
instructor feels that students who talk to him are ® 


| goal of independence. 
dimension of classroom behavior i is the degree of i in- 


limension of classroom climate is degree of group co- 
Typically the student-centered instructor attempts to 
up with a high degree of cohesiveness. He may attempt 
this by counting the number of ‘We’s’ as contrasted 
the number of ‘I’s’ verbalized in class discussion. ‘ 
r dimension upon which student-centered and instruc- 
lasses differ is in the degree to which the student feels 
influence his own fate. Perhaps it is unnecessary to elaborate 
point, but it may be pointed out that what have been 
dent-centered classes vary from classes in which the 
lays out a course outline, makes assignments, and ac- 
; des discussion, to classes in which almost all course plan- 
S done by group decisions and the instructor begins class 
it would you like to talk about today?” 
unt of class time devoted to personal experiences and 
the students may be another dimension. The instruc- 
concerned with the development of self-insight is more 
it or encourage discussion of personal problems. 
sly one could list many other dimensions upon which in- 
tered and student-centered teaching may differ. Those 
imply some which have been most frequently mentioned 
ture. When someone says he has used student-centered 
usually means that as compared with instructor-cen- 
a , his class has had a higher degree of one or more of 
es: 
nt participation in goal setting. ə 
asi8 upon affective goals. E 4 


È 
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8) Student participation and student interaction. 

4) Instructor acceptance of inaccurate statements. 

5) Group cohesiveness. 

6) Ability to determine its own fate. 

7), Amount of time devoted to discussing personal experiences 
and*problems, 


EXPERIMENTAL RESULTS 


One would expect that the controversy between our education's 
authoritarians and permissivists would long ago have been resolved 
by the cold logic of experimental studies. Unfortunately, this just 
hasn’t happened. The published experimental studies are not in 
agreement and there are a host of unpublished studies which re- 
main unpublished because the two methods used produced no 
significant differences in outcomes. 

By way of some examples: One of the best known comparisons of 


student-centered and instructor-centered instruction is that of Faw — 


(2). Faw’s class of one hundred and two students met two hours & 
weel to listen to lectures and two hours a week in discussion 
groups of thirty-four. One of the discussion groups was taught by 


a student-centered method, one by an instructor-centered method, 


and one group alternated between the two methods. 

As compared with the instructor-centered class the student- 
centered class was characterized by more student participation, n0 
instructor correction of inaccurate statements, lack of direction, 
and more discussion of ideas related to personal experiences. 

Surprisingly enough, Faw’s major measure of attainment of 
objectives was in the intellectual area. Scores on course examina 
tions showed small but significant differences favoring the student- 
centered method. In the area of his major interests—emotional 
growth—Faw’s method of evaluation was to ask students in the 
student-centered and alternating classes to write anonymous com 
ments about the class. Generally these comments seemed to indi- 
cate that the students felt that they received greater social an 
emotional valúe from the student-centered discussion groups 
they would have from an instructor-centered group. Despite the 
objective test results Faw’s students felt that they would have 
made greater intellectual gains in an instructor-centered class. 

Now compare Faw’s experiment with that of Asch (1). As’ 


like Ew, -staught all of the groups involved in his experiment. : 
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tered methods; one section of twenty-three stu- 
ht by a student-centered method, quite similar to 
ever, there were certain differences between Faw’s 
ents. In Faw’s experiment both student-cen- 
ctor-centered classes spent two hours)a weell lis- 
In Asch’s experiment, only the instructor-cen- 
subjected to lectures. While Faw doesn’t mention 
es that grades were determined by the instruc- 
Lof the course-wide examination. In Asch’s experi- 
in the student-centered class were allowed to de- @ 
grades. 
thing is that Asch’s results do not completely 
’s. On the final examination in the course students 
r-centered classes scored significantly higher than 
tl e student-centered class, not only on the objective 
test but also on an essay portion. Note, however, 
arte class was specifically told that the exami- 
in no way affect their grades in the course so that 
may be simply due to a difference in motivation. 
by the Bogardus Social Distance scale, attitude 


the instructor-centered classes a greater percent- 
f the-student-centered class improved in adjust- 
by the MMPI. 


n the instructor-centered section, but thought they 
ned more if they had been in an instructor-cen- 
is students rated the student-centered class higher 
ntered in helping them to learn the subject matter 
ut they actually scored lower. There seems to be 
the fact that advocates of student-centered methods 
ents’ perceptions of group achievement is erroneous; 
want students to take a larger share of the responsi- 
ion, and are pleased that they report great gains 
Social values in student-centered classes. If groups 
ter intellectual gains actually learn less, it might 
conclude that groups which report greater CG" 
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the areas of personal emotional growth may really be gaining less | 
than those groups which report lesser gains. 

One of the most comprehensive experiments in this area is that 
of Landsman (5). He experimented with a student-centered type 
of teaching as contrasted with a more directed type of democractic 
disdlssion erganized around a syllabus. His experimental design 
involved eight classes in a course sequence of Human Development, 
Adjustment, and Learning. Three instructors took part in the ex- 
periment and each instructor used both methods. Outcome meas- 
ures included the Horrock-Troyer tests, a local case history analysis 
test, Group Rorschach, MMPI, autobiographies, and students’ re- 
actions. His results showed no significant differences between meth- 
ods. 


A REDEFINITION OF THE PROBLEM 


What are we to conclude from these studies? While there is a 
dearth of follow-up data, with such slender results at the end of the 
courses our hope that either method produces significantly greater 
long-time benefits is probably unrealistic. Should everyone go his 
own way and teach any way he pleases? Personally, we are not 
willing to go quite so far, but certainly none of us should exclaim 
with horror, “His classes are instructor-centered!”” 

As psychologists, however, we believe in research. Why has re 
search on student-centered versus instructor-centered teaching 
seemed to lead up a blind alley? 

One reason suggested for contradictory results is that different 
people have meant different things by student-centered. But a far 
more important reason is that we’ve been lumping together more 
variables than we could handle with our experimental designs. 
The first part of this paper was concerned about defining dimen 
sions of difference because we need to work with a more limited 
number of variables and we need to relate these variables to the 
main body of psychological theory. a 

Do we have any assurance that in the welter of student-center 
versus instructor-centered research there are any variables t j 
make a difference? Despite a somewhat pessimistic view of E 
research in this area, there are enough glimmerings of hope 
justify further research. i: 

For example, Smith and Johnson (7) have found that studen d 
centergd tegching produces higher scores than instructor-centet® f 


é 
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n tests of reasoning ability and creativity. Furthermore, 
on the problem seems to agree that as compared with 

tered teaching, student-centered teaching results in 
t to the learning of facts (providing the classes have 
tests are based on the texts). 
Asch and Faw are not alone in feeling shat a@ust- s 
nd social skills are improved by student-centered methods. 
wch of Gibb and Gibb (8) indicates that students from 
classes which possessed many of the characteristics 
led ‘student-centered’ actually produced a growth in 
in experimental situations outside the classroom. Kelley E 
ne (4) found an increase of empathy in classes which 
taught by student-centered methods. Efforts to produce 

in instructor-centered classes have been unsuccessful. 
we cannot test every possible interrelationship of the 
volved in student-centered classes, but many can be 
pendently of the total method. For example, the vari- 
ee of student verbal participation in the classroom could 
from goals, cohesion, and other variables, and concep- 
5 providing an opportunity of student responses to be 
‘or punished. We might gain some insight into student- 
teaching by varying opportunities for verbal participation 
tying the percentage of comments which the instructor 
corrects. 
variable upon which we already have data is the degree 
the student feels able to influence his own fate. It seems 
t that almost every one who tries student-centered teach- 
finds that the problem of grades presents the greatest 
success. In addition two studies (3) (7) which produce 
change as ayresult of student-centered teaching both 
he power of the instructor by giving the group responsi- 
determining grades. 
m research at the University of Michigan supports our 
e student’s feeling of freedom is an important vari- 
earlier paper (6), the writer reported that students made 

on examinations when they were given the oppor- 
ite comments about test items. More recent experi- 
dicates that the effect of this opportunity is reduced 
nt is directed to write comments. Apparently the im-@ 
hg is that the student feel free to do so if hg "W to. 


\ 
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In addition, it was found in another study that students pre- 
ferred a directive method of teaching which made clear what the 
student had to do in order to pass the course. Again, we would in- 
terpret this as meaning that the student felt better able to deter- 
mine, his fate. : 

ed # in this way, the class in which the instructor insists 
on being non-directive may actually increase the student’s feeling 
of helplessness since he doesn’t know what to do in order to achieve | 
the goal of a good grade in the course. In this situation the student 
may simply perceive the instructor as using his power to block the _ 
normal pathways to the goal. 

From this theory one would predict that the effect of instructor 
permissiveness would depend upon whether or not the group pos f 
sessed the skills necessary to achieve their goals. In a new group 
the effect of instructor permissiveness may depend upon the pres- — 
ence or absence of individuals in the group who have had previous 
experience in working in democratic groups. If the instructor re- 
tains control of rewards, permissiveness with respect to means to 
the goal (such as assignments, classroom activities, etc.) may sim- 
ply increase the ambiguity of the situation for the student and 
reduce student learning. 

Obviously such speculations need to be tested. To make our 
research productive we need to sharpen both our theories and our 
methodology. 
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| technique is a device whereby bi-polar trait 
) of children may be obtained by using the mem- 
é group as the raters. It appears to be one of the e 
tools which yield information regarding the way 
‘a large adolescent group is perceived by this group. 
assumptions and scoring methods have been used by 
i the result that some ambiguity exists regarding 
n of scores yielded by this device. It was the aim 
‘develop a new scoring formula and to evaluate it 
th the two formulas previously applied when the 
que is used for certain types of traits at the 


RING METHODS OF GUESS-WHO RATINGS 


{form of this device, one or more pairs of statements 
trait, one statement of each pair representing 
g on what is arbitrarily designated as the positive 
it continuum, the other statement on the negative 
‘asked to indicate those members of the group who 
exhibit the behavior described by each statement. 
p. 221-36) used the following scoring method, re- 
ula A, for such ratings: 
— = (No. of positive mentions) — (No. 
ntions). 
uess-who ratings are obtained from adolescents, 
p is of necessity large, there is a wide varia- 
erent members are known by the raters. Con- 
A may give excessively extreme scores for 
are widely known and scores which are probably 
to for those who are not. In most cases this factor of @ 
is not related to the trait in question. 
deficiency Havighurst and Taba (8, P k ea 
i 151 4 
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what will be referred to as Formula B: 
Formula B.—Trait score = 


Q 
No. of positive mentions 
Total No. of pos. and neg. mentions on the trait 


X 100 


® But this “Formula appears to overcompensate, since it assigns 
the maximum score to each individual who receives only positive 
mentions regardless of their number. Furthermore, it requires 
discarding all cases which receive no mentions on the trait; many 
such'persons may be well known but their behavior may be judged 
consistently by all raters to fall between the two statements. 

Faced with such problems of scoring, Kuhlen and Lee (4) de- 
vised what amounted to two categories, one containing all scores 
with a greater number of positive mentions and the other consist- 
ing of all remaining scores. Bonney (1) translated the guess-wh0 
ratings to an arbitrary five-point scale. But such methods offer no 
adequate solution. 

To attack this problem of scoring guess-who ratings obtained 
from large populations, let us assume that each pair of statements 
for a trait represents behavior falling beyond two points at ap- 
proximately equal distances in opposite directions from the mean 
on the trait continuum. We may then assume that the number of 
mentions a person receives on a statement is equal to the product 
of two variables: (1) the amount of deviation shown by the indi- 
vidual on the trait in the direction indicated by the statement, and 
(2) a measure of how widely known the person is, that is, the num- 
ber of raters who are acquainted with him. As an estimate of this 
second variable one may use the number of times an individual i$ 
mentioned by the raters on a wide variety of unrelated traits. BY 
algebraically adding such deviations, after transposition, we ob- 
tain: 

Formula C-—Trait score = 


(No. of pos. mentions) — (No. of neg. mentions) 100 
Total No. of mentions on a variety of traits 
This new formula, Formula C, is applicable only to traits rated 


g” an instrument such as Tryon (5) developed in which a esr 
of trait ratings are obtained. It essentially yields trait scores W 


, 7° 
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d if all individuals being rated were mentioned 
aber of times on the entire set of traits. It would ap- 
provement over Formula A, since it weights the 
entions in inverse proportion to how widely known an _ 
gives a score, as Formula B does not, to an indi- 
not mentioned on the trait provided he is ment ned 
‘An empirical comparison of Formula C with each of % 
o formulas is therefore in order. 


STATEMENT OF PROBLEM 


purpose of the present study to compare thé pro- @ 
wula C with Formula A and with Formula B, as pre- 
ned, on two traits judged to be independent of this 
widely known, when the guess-who technique is 
secondary-school pupils. The bases of comparison were: 
‘distribution yielded, (2) reliability of scores, and (3) 
ores as measured by an independent criterion. 


SUBJECTS 


ho ratings were obtained from one hundred and ninety- 
3 and one hundred and ninety-six boys out of the four 
1 an thirty-six members of the B11 class in a senior high 
ited in a largely residential section of Los Angeles. These 
@ asked to rate the members of this class who were of 
sex, The group for whom ratings were scored consisted 
hundred and seventy girls and one hundred and eighty- 
ho had been enrolled a full school year. 


THE GUESS-WHO DEVICE USED 


forms of the guess-who rating blank were constructed, one 
One pair of statements was used for each of twelve 
any of which were taken without major change from Tryon 
traits may be described by indicating, in the order in 
y were listed on the rating blank, the opposing statements 
talkative—silent; old acting—young acting; friendly— 
likes school work—dislikes school work; considerate— 
‘ate; popular—unpopular (with opposite sex); not persist- 
ent; weleomed—ignored (by same sex); puts studies 
studies first; conceited—not conceited; *heerful—sad@ 
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influential—not influential (girls form only); athletically compe- 
tent—athletically incompetent (boys form only). Although, for the 
requirements of Formula C, traits could have been selected for the 
instrument which showed greater independence of each other, the 
variety appeared adequate. 

ð 


8 * TRAITS SELECTED FOR SPECIAL STUDY 


Comparative data for the three formulas were obtained from the 
two traits ‘likes schoolwork’ and ‘puts studies first’ which are thus 
indicated by using only the statement assigned positive values, 

SW The ‘wording of the statements for these two traits on the boys 
form, with appropriate changes for the girls in parentheses, was a8 
follows: 

Here is someone who likes schoolwork; he (she) usually enjoys 
what he (she) does in his (her) classes: 

Here is someone who dislikes schoolwork; he (she) usually detests 
what he (she) does in his (her) classes: 

Here is someone who will not do other things until he (she) 
finishes his (her) studying: 

Here is someone who always spends his (her) time on other 
things even though he (she) has studying to do: 


ADMINISTRATION AND SCORING PROCEDURE 


In giving their ratings pupils were instructed not to sign their 
names and to indicate by means of pre-assigned code numbers 
those individuals they wished to nominate for each statement. 
The number of self-mentions, discovered by an invisible fluorescent 
ink device, was so small that they were not excluded in the scoring. 
The number of mentions received by the one hundred and seventy 
girls in the study on all twelve traits ranged from 5 to 375 with 4 
mean of 68 and a standard deviation of 48. For the one hundred 
and eight-seven boys these scores ranged from 7 to 379 with a mean 
of 59 and a standard deviation of 45. f 

No Formula C scores were given to those pupils who received 
less than twenty-four mentions for the entire set of twelve traits. 
No Formula B scores were given to those pupils who received less 
than two mentions for the trait in question. Comparisons betwee? 
Formula C and each of the other two scoring methods were made 

Rn the population of those individuals who received scores from 
both B a being compared. ; 


rf 
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RESULTS 

ne for each of the two traits described earlier 
rately, were obtained for each formula. The 
and Formula C ranged from —1 to +1, with 
ranging from 5 to 11. For Formula B, the 
43 to 56 with all standard deviatidns ap; xi- Q 
ations between Formula A and Formula C scores 

7 to .82; those between Formula B and Formula C 
om .84 to .89. Since these scores were obtained from 
atings, it is obvious that the formulas are yielding 

b results. Ka 
istribution —All four of the distributions for Form- 

hly leptokurtic. On the basis of the chi-square test, 
‘of normality could be rejected for each of these far 
level. On the other hand all four of the distribu- 

a B were definitely bi-modal. Since about half of 

> ocated at the extremes no test of normality was 
‘Havighurst and Taba used several pairs of state- 

h trait, the bi-modal distributions usually found for 
ere taken as a sign of the reliability of the trait 


+3 


ae STANDARD SCORES 
ution of Formula A Scores on ‘Likes Schoolwork’ for 10s 
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Fre. 2. Distribution of Formula B Scores on ‘Likes Schoolwork’ for 140 
Boys. 


FREQUENCY 
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Fia. 3. Distribution of Formula C Scores on ‘Likes Schoolwork’ for 165 
Boys. 


01 level. This last distribution is shown in Fig. 3 with the Er 

sponding distributions from the other two formulas in Fig. 1 an 

Fig. 2. We may conclude that Formula C is the only one of the 

three methods of scoring which yielded distributions approzima mi 

normality on the two traits selected for study. Tests on most 0 
Sihe remaining ten traits in this instrument showed in every ¢25° 
that Y new formula produced normal distributions. 
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The split-half reliabilities for the two traits for 

5 and .85 for the girls and .95 and .92 for the 
not possible to use the split-half method with 
C since the total scores are not merely the sums 
sbtained from two groups of raters. Instead, since 
ficients between the two traits on Formula Agyor- 
ation were .98 for the girls and .99 for the boys, $ 
‘the two traits were sufficiently similar to be looked 
forms of a rating device each of which was measuring 
nee behavior with respect to schoolwork. Hence 
‘between the two traits were taken as estimates of @@ 
s offered by the different scoring methods. Such a 
3 slightly spurious results for Formula C because 
scores constitute indexes with the same denominator 
when the denominator (total number of mentions) 
out none of the correlations were reduced by more 


the reliability coefficients for each of the two form- 
pared were obtained from identical populations as 


JMPARISONS OF RELIABILITIES OF SCORES YIELDED BY 
. BAsED oN CoRRELATIONS Between Two TRAITS, 


Diff. bet. 
N r correspond- op* D/op 
ing 2’s 
165 -93 52 Al 4.67** 
165 73 
113 74 
113 -80 14 14 1.04 
161 -86 -38 ll 3.41** 
161 73 
110 -68 
110 -79 -26 14 1.90 


es reported have been rounded off after computation to 


ithout considering a correlational term. € 


ant at .01 level g L 
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indicated by identical N’s, The reliabilities given by Formula A are 
higher, both differences being significant at the .01 level, than 
those given by Formula C. This may be attributed in part to the 
highly leptokurtic nature of the distributions’ yielded by the first 
formula, thus allowing a few scores to contribute disproportionately — 
to the covariance. Formula C gives estimated reliabilities which are 


TABLE II.—Comparisons or CORRELATIONS WITH SCHOOL GRADES 
AND SCORES YIELDED BY THREE Formunas. Basen on Two 
TRAITS FOR BOYS AND FOR GIRLS SEPARATELY 


Ko : Scored : Dit. bet, | 
Trait Sex seat | N Shieh orrespond. | Dep 
Likes schoolwork Boys A 165 61 
c 165 71 .18 | 2.78 
B 140 .74 
c 140 .76 .05 :72 
Girls A 161 .61 P 
Cc 161 74 .25 | 8.77" 
B 133 .68 i 
c 133 .78 .22 | 3.17 
Puts studies first Boys A 165 57 x 
C | 165 66 14 | 2.18 
B 126 .63 
Cc 126 .69 .10 | 1.75 
Girls A 161 59 
c 161 .68 .16 | 2.66*" 
B 124 .70 
6) 124 74 09 |15 


Note: All figures reported have been rounded off after computation t0 
two decimal places. 

* All values of op were either .06 or -07. These were computed from oe 
mula including the correlation term giyen by Quinn McNemar, Peyohonri 
cal Statistics, (John Wiley and Sons, New York, 1949), pp. 124-125. T! 

„eight values of r, ranged from .66 to -79. 
$ * Significant at .05 level. 


oF ye at .01 level. 
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led by Formula B, but these differences are 
he basis of the test used. 

he best validity criterion available for the two 
point average during the pupils’ previous year in 
though studying and liking of schoolwork are pre- 
e identified with good marks, there js evidace 
are not as large as might be expected. The 
between each of the two traits, as scored by 
, with grades held constant, are close to zero in- 
ag negative. It would seem likely that the raters 
need in their judgments by school marks pupils 5 
teachers give grades partly on the basis of the 
by the student and completion of daily assign- 

both. 

changed to a numerical scale where, for example, A 
= 0. The mean grade for the girls on this seale was 
da deviation of 2.1. For boys the mean grade was 
dared deviation of 2.4. The correlations between 
cores on both traits for the three formulas are 


the .05 level. We may conclude that, if ries” 
uate criterion, Formula C yields higher validities 
ratings than does Formula A. Comparable data 
} j and Č indicate that in each case Formula © yields 
efficients, but only one of the differences is sig- 


sed formula was the only one of the three scoring 
ded normal distributions for these two traits. 
e no evidence as to how the scores on & trait are 
al distributions are to be preferred for practical 


(ss 
eral periority of Formula C over Formula A with @ 
city would support the assumption, for traits and 


those used in this study, that it is ak 


\ 
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to weight in some manner the number of mentions on a guess-who 
statement in inverse proportion to a measure of how widely known 
the individual is among the rating group. , | 
3) While the use of the total number of mentions on a variety 
of traits has given fairly satisfactory results as an estimate of ex- 
tenggof being known among the raters, a superior measure of this 
® variable might be obtained by having raters check off on a list 
those members of the group whom they know. This would allow 
an investigator to use fewer traits with several pairs of statements | 
for each trait. 


w 


SUMMARY 


A new method for scoring guess-who ratings was proposed for 
those situations at the adolescent level (1) where the population 
being rated is so large that there exists considerable variation in 
how widely known by the group the members are, and (2) where 
ratings are being obtained on bi-polar traits which are judged to be 
relatively independent of this factor of being widely known. When 
guess-who ratings were obtained from a large high-school class on 
two traits dealing with approach-avoidance behavior with respect 
to schoolwork, the proposed formula was judged to be superior to 
each of the other two scoring methods which have been previously 
used. The new method of scoring produced higher validity coeffi- 
cients with school grades as a criterion and was the only formula 
which yielded normal distributions. Some support was given to the 
assumptions made by the new formula. 
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ilness of the mental age as a unit of measurement de- 
on several considerations. One is the extent to which 
n who achieve the same MA score are alike in their mental ge 
his investigation is concerned with one special case of 
namely, the question of whether or not there are sys- 
ferences in performance on intellectual tasks between 
dull children who achieve like MA scores; i.e., like 
ages. Do MA’s of 10, for example, achieved by bright 
-old children differ systematically from MA’s of 10 
dull thirteen-year-old children? Are the MA’s earned 
groups the same MA? Do we need to specify the age of a 
der to interpret his MA score? 
investigators have tried to discover whether there are 
qualitative differences between groups of children of 
ronological ages but of equal mental ages. In most 
‘the groups have been matched for comparison on the 
nental age scores obtained from the administration of the 
-Binet scale. The investigators then identified the items 
anford-Binet, which discriminated between groups of young 
and older children equated on the basis of mental age. 
analysis data thus obtained were then presented as; 
of differences in the mental characteristics of the two 


E 
procedure is open to question for two reasons. In the first 
terion measures are not independent of those used for 
g and matching the groups. The requirement of equality 
| mental age scores necessitates a balancing of differences 
two groups on the individual items of the test. Thus, 
‘one kind of task for one group necessitates superiority 


on the writer’s doctor’s dissertation, State University of Iowa, & 
der the direction of Professors A. N. Hieronymus and J. B. 
( ” 
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in another kind of task for the other group, since the total mental 

age scores for the two groups are equal. The fact that the selection 

and criterion measures are not independent makes it desirable that 

the differences observed in these studies be evaluated by a pro- 

cedure in which the sources of differences are independent of the i 
gm used for selection and matching. 

Tn the second place, it is also questionable whether the matching 
procedure used in these former studies is defensible. When sub- 
jects are matched on the basis of obtained mental age scores, the 
estimated true mental age scores of the older, dull subjects are 

M certain to exceed those of the younger, bright subjects, on the 
average. Thus, if groups are matched on obtained mental age 
scores, one would expect the low IQ group to obtain higher scores, 
on the average, than the high IQ group on intellectual tasks which 
are administered independently of the matching test. 

The present study differs from those previously concerned with 
the problem in that (1) the low and high IQ groups were matched 
on the basis of estimated true mental age scores obtained by re- | 
gressing their obtained scores; (2) tests used in the selection of the 
sample are not used as criterion tests; (3) the criterion tests were 
administered independently of those used for selection and match- 
ing and were more extensive than those used by other investigators. 


RELATED RESEARCH 


One of the earliest investigations of differences between old (dull) 
and young (bright) children having comparable mental ages Was 
reported by Merrill in 1924. She concluded that the retarded 
group tended to excel on those types of items on which maturity, 
life age, and experience were favorable to success. In 1929 Wallin, 
finding the dull superior to the bright in the tasks of Counting 
Backward, and Vocabulary, pointed to age and experience 
determining factors in this success. Laycock and Stanley 10 19 
found the retarded group more successful in reproductive tasks, 
while the bright were superior in memory and reasoning. The an 
year, Rautman reported the bright superior to the dull in ee 
involving manipulation. Rameseshan in 1949 found the bri 
superior on the Primary Mental Abilities of Verbal Meaning 8” 

» Reasoning, whereas the dull were superior on Space. In exa Ma- 
the results of scores made on the California Tests of Mental 
Bi fn in 1950 found the bright superior on delayed reca 


i 
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the dull superior in spacial relations and nu- 


extensive of the studies found dealing with the prob- 
and Magaret (1947 and 1950) found normal 
dren of comparable MA’s in rote memory. In t 
e they reported some evidence to indicie a» 
ility associated with mental deficiency. 


PROCEDURE 


[wo groups of children of approximately equal mental 
different in chronological age and IQ were used «5 
‘groups by levels’ design was employed in the 
ts. The groups consisted of twenty-nine third- 
le children with estimated true Stanford-Binet 
and twenty-nine eighth- and ninth-grade 

ted true 1Q’s of 84 or below. Four six-months 
estimated true mental ages, ranging from ten years, 
s to twelve years, six months, constituted the levels.* 
groups were selected from classes in one mid-western 


pupils who appeared likely to fall into the two groups 
d dull. The Stanford-Binet scale was then administered 
"l ils so selected. Final selection of the subjects in the two 
made upon the basis of the Stanford-Binet scores. 
true scores used in constituting the two groups and 
thin the groups were obtained by the use of standard 
tions. The reliability coefficients (Stanford-Binet yi 
sed in the equations were: .982 for IQ’s below 70; 945 
88; .912 for IQ’s 116 to 129; and .898 for IQ’s 130 
ibjects in each group were assigned to levels on the 
timated true mental age scores which resulted from 
‘of the obtained scores. No further use was made 


study by Bliesmer involving essentially the same samples 

by the writer, was conducted to determine the nature of 

ees in a number of reading abilities between bright and 
ipproximately equal mental age. Emory P. Bliesmer, A & 
and Dull Children of Comparable Mental Ages with 

ous Reading Abilities, Doctor’s Dissertation, State Uni- 

, 1952, pp. 153. ə 


a \ 


a 


D levels in the population. 
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While the samples were not selected so as to control the number 
of boys and girls in each group, the sexes are represented in ap- 
proximately equal numbers in each group. 

Following is a diagram of the design as employed in the in- 


ier ae à 


Criterion Scores 


MA Level MA (Est’d. True) 
Bright Group Dull Group 
I 12-1 to 12-6 n-7 n-7 
II 11-7 to 12-0 n-10 n-10 
II 11-1 to 11-6 n-7 n-7 
IV 10-7 to 11-0 n-5 n-5 


Measuring Instruments —Used as cfiterion measures were the 
subtests of the Wechsler Intelligencé\Scale for Children, the Benton 
Test of Visual Retention (1945), an original Speed of Symbol 
Copying test, and the Chicago Tests of Primary Mental Abilities for 
ages eleven to seventeen. All of the criterion tests except the latter 
were administered in individual testing sessions. The Symbol 
Copying test was an adaptation of the WISC coding test, in which 
the subject was required only to copy the symbols presented "in 
the top boxes into the bottom boxes. The tests were administered 
by four qualified examiners wh6 worked independently. All of the 
tests were scored and rescored under the supervision of the writer. 
The results are reported in terms of simple raw scores. y 

Analysis of Results. For each criterion variable, the hypothesis 
tested is that there is no difference between the means of the popu- 
lations of which the bright and dull groups are samples. The test 
of this hypotheses is the ratio of the mean square for groups to the 
mean square for within-cells.* 


? This ratio is a valid test of the hypothesis of equal population D 
only if the following conditions are satisfied: (1) The subgroup samples ^t 
each level are drawn independently at random from corresponding aer 
populations. (2) The distribution of criterion measures in each subpoPp' ia 
tion is normal and all have the same variance. (3) The numbers of oa 
in corresponding subgroups are in the same proportion for all group dine 
this proportion is the same as that among the numbers in the correspon! 


ndomness 


There is no reason to question seriously the assumptions of ral de finitely 


P and homogeneity. The third condition listed above is 


4 
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ye per cent coefficient of risk was selected in advance of the 
ean square ratios were considered significant if they 
ceeded,the five per cent value in theF table. Through- 
e investigation differences are reported as either significant or 
mificant at the five per cent level of confidence. b 


RESULTS 


of the differences between means of the groups for 
ts is given in Table I. In most cases the differences 
and not statistically significant, although some are 


up was superior to the bright on the WISC subtests 
nsion and Coding, on the Symbol Copying test scored 
and on the PMA subtest, Number. The bright group 
or to the dull on the WISC subtest Digits, and on the 
opying test scored for accuracy. 
suggested that the superiority of the dull group on the PMA 
‘umber was due to the fact that this group, being made 
ghth- and ninth-grade pupils, had received instruction in 
ithmetic operations, specifically multiplication, to which 
of the bright children, who were still in the third 
grades, had not been introduced. Thus, the superior 
ance of the dull group appears to reflect no qualitative 
difference, but rather, a curricular one. The results 
Por ypothesis of no difference between the two groups in the 
ty mental abilities. 
à investigators have pointed to a superiority of the dull 


the present study since there is no possible distribution of 
subjects from level to level which can simultaneously be repre- 
Ree cnally of the mental age levels in the populations of 
dull subjects from which the samples were drawn. This lack of 
ativeness is of consequence only in the presence of a groups by 
ion. There was little or no apriori basis for suspecting inter- 
nt group differences from level to level) in the present study, 
ince each of the groups is fairly homogeneous with respect to 
Age, and IQ. The significance of the groups by levels interaction 
din each of the analyses by computing the ratio of the mean square 
on to the mean square for within cells. In the event of a sig- 
‘action the generalizing of the test of main effects (total-group «© 
uld be restricted to hypothetical populations showing the 
ons of subjects in the mental age levels as are found in the 
ne of the interactions were significant at the five per cl ive. 


® 
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Taste I.—Summary or DIFFERENCES Between THB Raw Score Means 
OF THE BRIGHT AND DULL GROUPS FOR ALL CRITERION TESTS 


Mg Mp |Ms |-| Mp F 

WIS 
'ormation 14.17 | 14.07| + 10] 0 
Comprehension 12.24 | 13.90) 1.66 | 9.69* 
Arithmetic 9.31 | 10.21| — .90 | 3.94 
Similarities 11.55 | 12.03| — .48 | 59 
Vocabulary 34.90 | 34.24| + .66 | 40 
Digits 9.93 | 9.00] + .93 | 8.85" 
Pict. Comp. 11.97 | 11.28) + -69 .93 
Pict. Arrange. 33.35 | 31.48| + 1.87 .99 
Blk. Design 22.48 | 23.07] — .59 | 05 
Obj. Assemb. 21.52 | 22.48| — .96 4 
Coding 36.10 | 948.55] — 12.45 | 30,63" 
Benton 12.28 | 11.83] + 45 98 
Symbol Speedt 91.45 | 141.07) — 50.62 | 29.65* 
Symbol Accur. 91.65 | 87.28] + 5.37 | 9.54* 

PMA 
Number 15.39 | 44,12} — 28.73 | 32.90" 
Verbal 30.23 | 30.46) — .23 | 0l 
Space 15.42 | 15.69] — .27 | Ol 
Word Fluency 33.23 | 33.35] — 12} _ Ol 
Reasoning 14.23 | 16.65] — 2.42 | 1,92 
Memory 7.46 | 6.88) + 58 | 38 


* Significant at the 5% level of confidence. 
t Symbol Speed scores are complements of time scores. 


in tasks involving manual manipulation skills. The results of this 
investigation indicate a significant superiority of the dull group ™ 
performance on the WISC subtest Coding, arid on the speed pri 
of the Symbol Copying test. However, while it is true that the du 

group did better than the bright on the speed part of the Symb 

Copying test, when the test was scored for accuracy the bright 
group was superior. It is reasonable to believe that careful attendi 
to neatness and accuracy, generally emphasized in the ar 
elementary grades, may have slowed down the speed of the be 
group to a significant extent, thus leaving the relative superiority 
of the two groups on this subtest something of a question. 18C 

Mo)pover, the performance of the two groups on the 
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on, Picture Arrangement, Block Design and 
was not significantly different. These tests go 
Narang part of the WISC. It would be ex- 

ull group were really superior in manual manipu- 
‘ould be expected to be somewhat consistently 
bright group on tests which ostensibly requiré@uch © 
ual manipulative skills; performance on which 
criteria of intelligence, and those performances on 
not. It is suggested here that if the dull older children 

the bright younger children on certain manipulative 

se tasks may not be good intelligence tests in the € 


ance of the dull group on the WISC subtest Com- 
s significantly superior to that of the bright. An 
of the kinds of questions which are included in this 
ests that successful answers depend primarily upon 
example; Why is it better to build a house of 
of wood?). 
ng that the bright group was superior to the dull in 
mance on the WISC subtest Digits supports the writings 
es igators who have reported a rote memory superiority 
ght groups. However, there is reason to doubt that they 
al superiority in memory ability since the small 
in favor of the bright group on the PMA subtest 
‘on the Benton test were not significant. 
of the present study support to a very considerable 
generality of the mental age concept. No evidence was 
the claims of some writers that the bright are 
mental tasks as vocabulary, reasoning, and a 
y ability. Nor does the evidence from the study 
flority of the dull older children in performance tasks 
ipulation. 
this investigation has failed to reveal anything very 
use of the Stanford-Binet MA as a measure of level 
opment. 
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3 being made in formulating the relationships veen g 
onging to different areas of psychological investiga- 
hy 7). Miller and Dollard (2) have discussed the effect 
on upon ideation from the standpoint of learning theory. 
) have reported data on the influence of affect on pef- E 


on ith which the present research is concerned is the 
ain kinds of feeling-tone in the classroom learning situ- 
‘applicational transfer’ of learning. An achievement 
structed for the purpose of measuring the student’s 
‘Use information for solving problems in life-like situa- 
ie position taken is that possession of information does not 
imply ability to use that information, but rather that 
y is partially dependent upon the amount and quality 
involyement in the learning situation and upon the 
y structure of the learner. The extent to which products 
ing have been inhibited—one dimension of personality 
he subject of this paper. 

ic hypothesis to be tested is that limited ability to 
principles in problem-solving is, in part, a function 
to which previously learned words and ideas have 
bited or blocked. Applicational transfer is defined as a 
which relevant concepts are brought to conscious ap- 
ugh the operation of necessary and sufficient generali- 
n the attempt to solve a problem. Inhibition as used herein 
: ‘interference with the associative reaction in the normal 
lon of a ‘cue-producing’ response (2: p- 322), with a conse- 
ction in efficient ideation. The cue-producing response 
blocked is viewed as capable of arousing feelings of fear 
ion of the writer’s doctoral dissertation entitled Applicational: y 
Nature, Measurement and Improvement sponsored by M.D 4€ 


University. The writer is indebted to Professors J.E. Hoch- 
A Ronald Lippitt and E. Lowell Kelly for critical reading 
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or anxiety as a result of previous learning? Symptoms of inter- 
ference with the associative reaction, according to Rapaport (8: 
p. 9), are delay or absence of response, or repetition of the stimulus 
word in a word-association test. i 
The generalization process underlying applicational transfer is 
congjdered dependent for its sufficiency not only upon breadth of 
relevant meaningful experiences, but also upon relative absence of 
interference with necessary cue-producing responses. This view 
finds indirect support in Razran’s report of studies in semantic 
generalization: “. . . generalization [of conditioned stimuli—single 
S words] to the other word categories [synonyms, antonyms, etc.) 
was the greater .. . the faster their reaction time in controlled as- 
sociation tests” (9: p. 355), whereas, as noted above, interference 
is characterized by delay or absence of reaction in a word associa- 
tion test. 

The word association test was selected as a method appropriate 
for the measurement of inhibition as defined. An index of applica- 
tional transfer ability was provided by the differential between 
two scores on an achievement test in general psychology, reported 
elsewhere (11). The test consists of two parts, one purporting to 
Measure information, the other, application, with each having 
eighty multiple-choice questions, The application part is composed 


of five reading selections concerning mental hygiene, learning, 
marital adjustment, all in story or anecdotal form, and two reports 
of experiments, followed by questions requiring identification of 8 
principle, example given, of an example with principle given, of 
facts crucial to the solution of a problem, and others. ; 
The problem is whether superior appliers and inferior appliers 
show a similar differentiation in respect to latency on a word a880- 
ciation test, i.e., do superior appliers show brief latency and in- 
ferior ones delayed latency as might be predicted from the fore- 
going analysis. 
SUBJECTS 
Ss were fifteen introductory psychology students, each assigned 
to one of two groups on the basis of having a differential between 
his score on the application section of an achievement test 0 


> * Dollard and Miller refer to this interference as ‘repression’ (2: P- on ‘ 
but the process, viewed empirically, also appears to fit O. H. Mow 
formulfftion of,the ‘anticipatory response’ (6: p. 181). 
9 
9 


ina 


|Achieve-| 
Word | ment | Applica-|Informa-| Differ- 
Fluency | Test tion tion ence 
(total) 


19.1 | 67.7 | 75.3 | 33.1 | 42.2 | 9.1 
19.0 | 48.0 | 72.0| 24.0 | 48.0 | £0 o 


9.7 3.3 | 9.1 5.8 | 14.9 
124.) — -05 10 01 


bove or below the mean of the group differential. 
ers’ (N = 6) scored, at the mean, 24 points 
ication test than on the information test. (See 
erior appliers’ (N = 9) scored, at the mean, 9.1 points 
pplication than on the information test. 

extreme differentials, there was no significant differ- 
n the groups in total score on the achievement test in 
the Weschsler-Bellevue information subtest, 


are virtually identical. 


APPARATUS AND PROCEDURE 


‘lation test has traditionally been used as an aid 
bition (8: p. 35), conflict (3) or emotional dis- 
363). Although qualitative analysis of responses 
| with individual cases, reaction time has yielded 
ts for groups (3, 10). One criterion of inhibition 
atency greater than that of normal subjects. Gardner 
à mean reaction time to ‘neutral’ words (e.g., tree, lamp, 
f 1.85 sec. for normal college students. 
fifty-seven words was adapted from a list of ‘emo- 
and ‘neutral’ words, so termed on the basis of 
Universal agreement among competent judges’, re- 
(10: pp. 109-110). The words were chosen for 
e in several problem areas, Viz., academic (study, 
alone, friend), family (mother, father), and religious 
These were arranged with an attempt to alternate 


tral words, and were transcribed at 20aec. — 
© 


Q 


172 The Journal of Educational Psychology 


on an Ampro Tape Recorder. The recording was utilized in order 
to increase the objectivity of the testing situation. Alternation 
with neutral words and use of intervals were introduced for the 
purpose of reducing the effect of possible perseveration due to 
emotional arousal. . 
S% who were tested ‘blind’ in a standard situation, were in- 
o structed to respond orally with the first word which came to mind 
after each stimulus word was heard, to respond to all words, and 
to attempt to keep the mind blank between words. Latency was 
recorded in fifths of a second by means of a stop-watch. 


RESULTS 


Mean seconds for each group are reported in Table II. The re- 
sults clearly indicate differences favoring the superior applier group. 
All differences are significant beyond either the .01 or .02 level of 
confidence. Areas yielding the largest differences are religious and 
social; those yielding smallest differences, neutral and family. 


DISCUSSION 


It was predicted that, if the process of applicational transfer is 
influenced by inhibition of concepts involved in past learning, 
‘superior appliers’ should show a lesser degree of inhibition, a8 
determined by reaction time on a word association test, than should 
‘inferior appliers’, The predicted difference is clearly indicated. 

Individual results are congruent with group means with the ex- 
ception of one § in the inferior group who obtained mean latency 
below two seconds for all areas. Analysis of his test answer sheet 


TABLE II.—DIFFERENTIAL Latency or SUPERIOR AND INFERIOR 
Appiiprs on A WORD ASSOCIATION TEST 


(mean seconds) 


Related Area 
Group N 7 
Academic | Social | Family | Religious] Neutral | Total 
Superior 9 | 1.66 | 1.84 | 1.25 | 1.44 | 1.56 a 
Inferior 6 2.86 | 3.35 | 2.18 | 3.34 | 2.34 | 2 
Difference 1.20 | 1.51 .93 1.90 .78 ro 
Dt 2.93 | 3.26 | 4.90 | 3.43 | 2.87 a 
ue 2M OU OL, | ).01 | 02): 


ae 
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i he failed to attempt twenty-eight of the eighty appli- 
because, he subsequently stated, he was overly cau- 
t part. Had he guessed on all omitted items, his appli- 
. by chance alone, might have been such that he would 
selected as a subject. 

(8: p. 17) suggests that a latency of four secondg be 
d indicative of a disturbed response. Responses of in- G 
include an average of 8.0 disturbed responses per 
those of the superior group include an average of 
onses. 

explanations may be offered. First, if reaction time g 
the inferior group, it is possible that they did not com- 
ievement test. If so, the differential between applica- 
nformation scores might not be indicative of lesser appli- 
ity. The application section was administered first, 
d was completed by all well within the time limit. 

, the inferior group may have been more cautious, thus 
ore items and lowering the application score. While this 
might be of interest, an item count indicates that, with 
tion of the one S discussed above, the poor appliers 
less than half as many items per S as did the superior 
possible significance of this result has not been explored. 
some unconscious bias on the part of the investigator, 
nt slowing of his own reaction time occurred even 
vestigator was unaware of the group to which a sub- 
. Analysis of individual record sheets indicates that 
ne of the inferior appliers had some words with latency 
ble in brevity to those of the high appliers. Moreover, 
t of the inferior group yielded a latency profile equiva- 
irevity to those of the superior group except for one dis- 
Area, While not conclusive, these factors suggest the un- 
of such an alternative. © 


SUMMARY 


apt has been made to define ‘applicational transfer’ and 

$ hypothesis that limited ability to apply principles in 

solving is, in part, a function of the extent to which words 
inhibited or blocked. Fifteen Ss were assigned to two € 

he basis either of a large or a small differential between 

On scores and information scores on a previously y 


(9 
© 


investigation, applicational transfer and emotional disturbance, 


ao 
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achievement test in psychology. The groups were shown to be | 
approximately equated in certain other respects. With greater than _ 
normal latency on a controlled association test as the criterion of 
inhibition, superior appliers were found to have significantly shorter _ 
reaction time and fewer disturbed responses than inferior appliers, 
Altgrnative explanations are discussed. 

relationship between variables belonging to different areas of 


has been suggested. 
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tion exists between degree of learning and delay of 

“is well known. However, most experimenters in 
allowed the ‘response’ to be made prior to the ee 
of reinforcement. Thus, the gradient which is 

is the backward one; this means simply that the 

cedes the reinforcement. However, the results of 

er, Dove, and Bernstone (10), Jenkins (7,8, 9), Thomp- 

Jos e (12), suggest that a gradient of delay also exists in a 
ection; that is, when reinforcement precedes the 


ion exists between amount of learning and immediacy 
nt is well known. However, in most studies of this 
) the experimenter has allowed the ‘response’ to be 
to the administration of reinforcement after a specified 
ne. Thus, the gradient of delay which is usually studied 
ard one; this means simply that the response precedes 
ent. On the other hand, the results of Muenzinger, 
mnstone (10), Jenkins (7, 8, 9), Thompson and Dove 
S that a gradient of delay appears to exist in a ‘for- 
fection; that is, with the reinforcement preceding the re- 
ony , 
counterpart of a reinforcing state of affairs preceding 


junction is apparent in many classroom situations, it 
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appears desirable to extend our notions on this point at the human 


level. 

Hull (6) points out that the significance, if any, of ‘forward’ delay 
in reinforcement has not yet been carefully explored, but he is of 
the opinion that since the reduction of a need necessarily follows 
rater than precedes the act, it would seem that the ‘forward’ de- 
lay could hardly play an important rôle in selective learning. The 
problem, then, was to test the prediction that differential perform- 
ance would result from delayed and immediate reinforcement, when 


‘reinforcement precedes the response in a group situation. 


> 


PROCEDURE 


The subjects were sixty-four third-grade pupils at Central ` 


Elementary School in Martinsville, Indiana. Groups I and II were 
composed of two independent third-grade classes. Day 1 was de- 
voted to obtaining an initial measure of ability for the two groups 
in a routine computational task. Group I was verbally reinforced 
on Days 2-5 immediately before being given the task, while Group 
II was required to wait one and one-half hours after receiving rein- 
forcement before the task was given. On Days 6-8, conditions were 
reversed. Group II received the task immediately after reinforee- 
ment, while the Group I delay between reinforcement and task 
was one and one-half hours. 

The reinforcement was similar to that used by Auble and Mech 
(1, 2) in previous classroom investigations. The teachers were re- 
quired to adhere to the following verbal reinforcement throughout 
the experimental sessions: “You children are doing very nice work 
on the problems and are improving rapidly. Let me see how much 
better you can do today. Try to avoid any careless mistakes, and 
try to do as many problems as you can.” 

The assumption, of course, is that the verbal stimuli from the 
teacher, designated as belonging to a class ‘reinforcing’, will in- 
crease the probability that a particular class of response will occur 
more frequently than if these stimuli were withheld. 

The reinforcement was administered at approximately the same 
time each day. However, the task was either given immediately 
after, or delayed one and one-half hours in accordance with the 
operations previously described. 

All $’s were required to participate in the task as part of the class- 
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, so far as can be determined, were naive with 
that an experiment was being carried out. 
in fhe main, a routine arithmetical one consisting 

er combinations that were to be added or sub- 
was allowed five minutes a day to work the prob- 
enough problems, however, so that the Onost @ hi 
not able to finish before the time limit. Teachers 
to avoid remedial work dealing with the task 
‘experiment. 
RESULTS AND DISCUSSION Pst 


s the cumulated transformed correct responses for 
under both delay conditions. It should be noted 
root transformation was applied to the raw data. 
me in accordance with a recommendation by Bartlett 
e of the proportionality of the successive means and 
well as heterogeneity of variance. 
of covariance was carried out between the trans- 
correct responses and the mean cumulated trans- 


4 
DAYS 
cumulated transformed correct responses for two groups 
2-5 Group I received no delay, while for Group II a one and 
delay existed between reinforcement and task. Delay con- 
s 6-8 were reversed for the two groups. (Group N = 32). 
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formed correct responses on Day 5. The obtained F value of 1.75 
was not statistically significant at the .05 level of confidence. The 
meaning of this insignificant value of F is that it indicates simply 
that the differences in the cumulated means of the groups of Day 
5 can be accounted for by differences in mean level of initial ability 
as n®asured in the pre-test trial on Day 1; for as Fisher (4) points 
out the means of the groups on Day 5 have been ‘adjusted’ by the 
analysis to a common mean initial level of performance on Day 1. 

Similarly, a covariance analysis was computed between the Day 
5 and Day 8 mean cumulated transformed correct responses. This 
operation was carried out in order to test for differences when delay 
in reinforcement was reversed from Group II to Group I. The ob- 
tained F value of 1.67 was again not statistically significant, indi- 
cating that whatever difference existed on Day 8 could not be at- 
tributed to the delay in reinforcement. A glance at Figure 1 shows 
that Group I maintained a relatively superior rank of correct 
responding over the seven sessions and the delay in reinforcement 
during Days 6-8 clearly did not exert any suppressing effects. 

A similar analysis was carried out for the error responses. Figure 
2 shows Group I and II errors under the two conditions of delay 
for a seven-day period. A covariance analysis between the initial 
measure and Day 5 indicated that the group difference was not 
significant (F < 1). Finally, the covariance F value of 3.20 that 
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Fig. 2. Mean cumulated transformed error responses for two groups of 
S’s. On Days 2-5 Group I received no delay between reinforcement i? 
task, while for Group II a one and one-half hour delay existed. On Day 
6-8 delay conditions were reversed for the groups. (Group N = 32). 
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Jassroom or group data exist, at present, with 
e cited results. The data, however, indicate 
ferences existed between the groups that can be 
delay between reinforcement and task. These è 
patible with the results of previous investigations 
h utilized infra-human organisms and, especially so 
with Jenkins (7, 8, 9) whose data strongly suggest 
of delay exists when reinforcement precedes the 
ore specific comparison might be made betwee: 
rollary iii and the obtained data in the present study. 
ollary iii B states as follows: “The greater the delay 
‘of the incentive by groups of learning subjects, the 
be the resulting learned reaction potential ...” (6, p. 


ers to his treatise as a general behavior system, 
infra-human organisms, the ‘deduction’ to be made 
corollary is that if each subject on each trial under- 
ys of the experiment, performance or reaction po- 
be weaker as delay is increased. The present empirical 

not correspond to this prediction. Such a result, 
pears to bear out Hull’s statement that the corollary 
n has yet to be verified in ‘detail’. 

it review Stroud (11) makes a cogent case for pursuing 
of experimentation to clarify certain notions with re- 
or theory for education. Certainly animal experi- 
ad theory construction have made rapid progress dur- 
decade, presenting the educational psychologist with 
ipts that appear to be testable under classroom condi- 
‘such concept has been tested in a classroom situation. 
e design was relatively simple, it should be noted that 
results were obtained from the two independent class- 


t behavior ‘postulates’. There is no reason, then, why 
m for education should not basically develop frome 
and problems of the classroom. 
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SUMMARY 


The present study began with the belief that progress could be 
made toward evolving a behavior system for education by virtue 
of testing certain concepts derived from learning theory. Specifi- 
oe the question to be answered in the present experiment was: 

® Would differential performance be obtained in_a specified task 
under two varying conditions of delay in verbal reinforcement? The 
deduction to be made from existing experimental literature on the 
infra-human level is simply that as delay in reinforcement is in- 

Gm, creased response strength in a specified task should decrease. 

To test whether delayed reinforcement would have less advan- 
tage than immediate reinforcement in a classroom situation, the 
following procedure was used: Two independent third-grade class- 
room groups composed of thirty-two subjects each worked in a rou- 
tine task for five minutes a day for eight days. Day 1 was devoted 
to obtaining an initial measure of ability for the two groups. Group 
I was verbally reinforced by their teacher on Days 2-5 immediately 
before being given the task, while Group II was required to wait 
one and one-half hours after receiving reinforcement before the 
task was given. On Days 6-8, conditions were reversed. Group II 
received the task immediately after reinforcement, while the 
Group I delay between reinforcement and task was one and one- 
half hours. 

The covariance analyses for both error and correct responses 
indicated that no statistically significant differences existed be- 
tween groups throughout the experiment that could be attributed 
to the delay between reinforcement and task, The results clearly 
were not compatible with a prediction from a theoretical model 
that was based largely on infra-human data. The study, however, 
is indicative of a future program of intensive classroom experi- 
mentation that if pursued vigorously could contribute materially 
to the construction of a theoretical legging framework for educa- 
tion. 
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BOOK REVIEWS 


Heren M. Rosrnson, Ed. Clinical Studies in,Reading IT. Suppl. 
Educ. Monog. No. 77. Chicago: University of Chicago Press, 
apoa, pp: 189. $3.75. 


In this second report from the Reading Clinic of the University 
of Chicago, emphasis is placed upon visual problems related to 
difficulties in learning to read. The first part deals with treatment 
of poor readers with vision problems. Included are discussions of 


Bz ypes of problems encountered, visual screening tests used, pro- 


cedures for referral, and teaching adaptations. It appears that 
visual difficulties interfere with adequate reading progress in in- 
dividual cases. The incidence of visual problems among poor readers 
is sufficiently high to warrant visual testing of all disability cases 
to identify cases in need of examination by professional refraction- 
ists. Results of such examinations provide information useful in 
remedial teaching. 

Four reports of staff research are given in Part II. These reports 
are concerned with the factors of visual efficiency, personality 
adjustment, eye-hand preference and reversals, and prediction of 
increase in reading rate in relation to progress in learning to read. 
In general, intimate relationship between these factors and reading 
progress was absent. However, pertinent evaluation of the experi- 
mental techniques employed were made with suggestions for fur- 
ther research. 

In Part III appear four reports of research by graduate students. 
Important findings include: (1) A combination of results from 
visual screening tests plus teacher observation of symptoms provide 
a better basis of referral to a refractionist than either alone. (2) 
Girls tend to excel boys in reading through the first three or four 
grades only. (3) Although auditory acuity and auditory discrimi- 
nation do not appear to be widespread causes of inefficient word 
recognition, there is a suggestion that auditory memory Span A 
important. (4) Important factors in leisure reading by retard 
readers include desire to read, availability of books of proper P- 
terest and easy vocabulary, short books, and a scheduled time for 
free reading. ae 

Part IV consists of five papers by leaders in the field on vision 
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ered at a conference on the subject. Jobe’s paper 
ssion of the structure and function of the visual 
emphasizes certain developmental factors in 
The visual problems of poor readers receive 
mtion in an analysis and evaluation by Eames. In 
‘visual training and reading, Eberl emphasizs the 
nplete case study of all visual functions as well as a 
jon for disabled readers. In certain cases, visual 
plement remedial teaching to restructure and 
‘reliable visual behavior pattern possible. Such train- 
eaching to see’ rather than just ‘muscle exercis 
is not clearly understood by many clinicians. 
this monograph points out that the series of experi- 
and discussions do not provide final answers to the 
lered. However, the contributions are important in 
knowledge of the relation of visual defects to read- 
evaluation of experimental and measurement tech- 
posing additional problems for investigation, and in 
it implications of the findings. All workers in remedial 
ll profit by careful reading of this monograph. 
of Minnesota Mines A. TINKER 


Bernarp. Mental Hygiene for Classroom Teachers. 
ik: McGraw Hill Book Company, 1952, pp. 472. 


olume is an outgrowth of fifteen years of experience in 

murses relating mental hygiene to education.” (p. vii). 

in four parts, Part 1, Basic Considerations, discusses 

the mental hygiene point of view; human needs and 

th; the nature of maladjustment; meeting the needs 

and special needs of adolescents. 

Mental Hygiene in the Classroom, deals with teacher 

and pupil behavior; understanding and helping children 
the mental hygiene of discipline; teaching methods 

justment; some questionable school practices; per- 

oblems in the classroom; and constructive classroom 

mental health. £ 

pecial Approaches to Mental Health, includes art asan ` 
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approach to mental health; writing for understanding and release; 
using drama and play; and limitations and precautions regarding 
mental hygiene. o 

Part 4, The Teacher’s Mental Health, discusses a positive view 
of the teaching profession; the teacher’s philosophy—in the school; 
and he teacher’s philosophy—adult mental hygiene. 

“Emphasis is placed on the normal child and on mental hygiene 
in the classroom rather than through the use of clinics, formal 
guidance services, ete.” 

“Improvement in educational practices is based on willingness 

try theoretically sound methods in spite of beliefs that present 
techniques are efficacious. But such experimentation should be 
based on a consistent point of view—there should be a frame of 
reference. The outlining of such a frame of reference is the task 
that the writer has undertaken.” (p. vii). Apparently mental hy- 
giene is the frame of reference. There are techniques that are only 
for the use of experts. The writer is concerned in this volume with 
those which should be known and used by teachers. 

This is not a fundamental text in mental hygiene but an attempt 
to show how mental hygiene can help in the work of the school. 
It is thus primarily a book on education. Wallin is not mentioned, 
although Shaffer (1936) is quoted. References are especially to pub- 
lications concerning teaching methods and teaching aids. 

The spread of subject matter can be seen in subjects discussed 
or references cited: teaching how to write; relations of adminis- 
trators to teachers; problem of aging; grading systems; need for 
exercise; learning; projective technique; use of test data; ventila- 
tion control; visual aids; therapy through art; sociometry and 
pupil adjustment; ‘experimental approaches’ which seem to be 
‘readily available to ordinary classroom teachers’.” Psychodrama 
and sociodrama (Sociodrama is an extension of psychodrama) (p. 
343) are discussed and recommended for the use of the classroom 
teacher. 

The statements, “There is a correct way to rear a child,” and 
“There is a correct way to teach,” (pp. 375, 376) lead us to wonder 
if these have been discovered; but also leave the suspicion that 


, probably there are several ways for each task, and that different 


@ 


ays might be necessary to deal with individual differences. The 
latter are mentioned, grudgingly, in two paragraphs (p. 146). 


B iros given by the book is that many good oe 
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ers; that special techniques are assumed to be 
hers but with questionable foundation and train- 
are dogmatic and imperative statements, not all of 
‘accepted; some of these are constantly repeated 
d in pedagogical literature, and, though trite, some 
bear repeating. - iG) © 
‘and attitude of the book are good; the style is 

į some terms are used loosely, e. g., experimenta- 

ntal; the spread of subject matter suggests the 
which gives orientation but which does not deal 

h any one subject. A. S. Epwarps “<— 


y of Georgia 


Lacey. Statistical Methods in Experimentation. New 
Macmillan Company, 1953, pp. 249. 


text should prove to be an excellent one for a first 

tics, It touches upon experimental design, probabil- 
normal distribution, tests of significance of means and 
between means, Chi-square, correlation and regres- 
lucial limits. Assumptions underlying each statistical 
explicitly stated and justified verbally. The beginner 
by extensive derivation of formulae, an approach 
e desirable wherever classes include many persons with 
background in mathematics. Such topics as sampling 
I the differences between one-tailed and two-tailed tests 
ince, for example, are introduced easily and simply in a 
‘should provide a good foundation for later expansion. 
of exposition is remarkably good. One might wish to 
the discussion of small sample theory, and perhaps 
5 of the Chi-square test, if one’s class moved fast enough 


ples and problems are included. Usually these are 

S of chapters, but some are within the chapters, close to 

cussion. These are in addition to the illustrative ex- 

by Lacey in his discussion, The examples are to be 

' the student, and there are answers given for these, 4 
Some very good explanatory material for each item. 
llow, involving the operations just discussed t the 
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chapter and illustrated and explained in the examples, For the 
convenience of the instructor, answers ought perhaps to be pro. 
vided somewhere for these also, although, they may be easily 
worked out in a short time. The examples and problems are nicely 
distributed over several kinds of research studies, so that applica- 
tio®s of given techniques to various fields are well illustrated, 
Several tables are included, not all in the form usually encoun- 
tered in similar texts. Table B, for example, is an abbreviated 
version of the familiar table of percentages of total area under 
the normal curve between the mean ordinate and other ordinates, 


SAt would require the student to interpolate more frequently than 


» 
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with the fuller table, which might provide an additional hazard 
for the beginner. 

As one who has taught elementary statistics for several years to 
students who have almost no background for the course, the re- 
viewer believes this book comes as close as any he has seen to being 
an ideal teaching text for that kind of class. It may be considered 
too limited in scope by those who have more advanced classes, or 
who wish their courses to cover more ground. For students in 
psychology, education, sociology, and related fields who are limited 
in background, however, this text should provide an excellent in- 
troduction to statistics and experimental design which involves 
very little suffering. Ceci M. FREEBURNE 

Bowling Green (Ohio) State University 


Cuartes KocH. The Tree Test. New York: Grune and Stratton, 
1952, pp. 87. 


Tn this book are presented the interpretations which the author 
has found applicable to elements of the form, location, and grapho- 
logical features of drawings of trees, used as a projective technique: 
For this purpose the subject is requested ‘to draw a fruit tree. 
The author further suggests that the subject draw additional a 
“if one has the impression that the first drawing obtained does n° 
correspond with the person tested or that the drawing differen- 
tiates too little.” He claims that such successive productions tap 
different layers of the psyche. ok, 

The introduction, which comprises the first half of the bo of 
ie of a lyrical and imaginative treatise on the symbolism 
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ea 
by Pees illustrative analyses of test subjects. 
alidity of his interpretations the author states 
e heen proved correct “(1) by the nature of the 

(2) by comparative investigations, by direct 
ng and by judgements of independent per- 
been employed mainly in the vocational gðid- 

and adults, and the author points out with 8 
t additional samples would be needed to extend 
‘tree drawings to special groups such as the 
t or the mentally ill. He also suggests that the 
“conjunction with other tests for diagnosis ant- 
the findings may check and augment each 
te that the author does not see fit to extend 
and critical attitude toward the interpretations 


lective data contained in the book are tables pre- 
velopmental trends for various tree-drawing charac- 
@ age range from five to sixteen years, and a com- 
two groups of the same age but different mental 
rtunately, all statistics are presented as per- 
ication of the sizes of the samples studied. 
of the book contains clear and detailed illus- 
ible variations in the drawings of root, trunk, and 
th lists of interpretations for each detail. The 
t interpretations suggested for any one drawing 
ould give the psychodiagnostician ample leeway 
is imagination in producing a rich character por- 
ject. However, being a skeptic with regard to the 
phological interpretation and the “scientific anal- 
expression,” this reviewer is inclined to wish for 
| more fact. 

y have some value for those who are particularly 
Wing as a projective technique. It would be quite 
Compare the interpretations given here with those 
drawings on the HTP test. It might also be of 
ne whether the same developmental trends ob- 
z as those found by the author for Swiss children. 

7 GoLDINE C. GLESER # 
ersity School of Medicine 
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James L. Murseuy. Using Your Mind Effectively. New York: 
McGraw-Hill Book Company, 1951, pp. 254. 


In spite of the more general implication of the title and the 
author’s protestation against confining the concept of mental effee- 
tivgness to scholastic efficiency, Using Your Mind Effectively can 
be described best as a ‘how-to-study’ book written in popular style, 
The theme upheld throughout the volume is that the achievement \ 
of better thinking is the requisitie for attaining greater adepi 


in studying. Since the desideratum of better thinking is learnable W 


Saad teachable, Mursell’s task is to show the way to the reader. 


The steps in reaching comprehension and understanding are for- 
mulated as: (1) obtaining an over-all view of the material under 
consideration, (2) identifying its main features, and (3) working 
the details into their proper relationship to the main skein. 

After an introductory section on the value of mental efficiency, 
the book is broken down into three main divisions; one dealing 
with an exposition of the general psychological principles to be 
observed in attaining intellectual mastery and two devoted to the 
demonstration of practical applications. In Part One the author 
elaborates on the nature and usefulness of the sequence; picking up 
the main thread, identifying essentials, and relating the details to 
the whole. He discusses the importance of this sequence in the 
reading of textbooks, study of college courses, and the writing of 
term papers and theses. The construction of a mental map inte 
which essential parts and details may be fixed is offered as the 
modus operandi for achieving mental excellence. Applications of 
this technique to non-academic situations are demonstrated, ¢-+ 
preparing a talk, orienting oneself to a new job, arranging one's 
budget, and learning to play tennis. 

In Part Two standard topics of the ‘how-to-study’ books such s$ 
budgeting time, note-taking and note-using, self-testing, and a 
centration are discussed under the rubric of working tools an — 
practical plans, The general rationale for the recommended 4P- 
proaches as well as the minutiae of procedural detail is set forth. — 
Much of this is the stock advice on these topics of ‘how- 
books. The author appears at times to be self-conscious about some 


8 z» of the trivia which he presents. After marshalling a lengthy list 


ð 
© 


materials and equipment that should be kept in the study noe 5 
he f constrained to remark that he offers this for whatever 
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dng should be the governing consideration in the 
of time, sonditions, and methods of work. Everything 
mental to the goal of encouraging thinking; note- 
ew are seen as experiences in thinking. 
e under the title “Some More Extended App ea- 
the following topics: memorizing, reading, writing 
and theses, and creative thinking. Good memorization 
conditional upon proper purposing, understanding, and 
nalyzing, interpreting, and thinking are considered the 
ë development of reading skill. In writing term papi 
nded that the outline and over-all plan be completed 
e word of the final manuscript is written. In a very 
on creative thinking, Mursell makes the point that 
one grasps the meaning of another person’s writing, one 
d in creative thinking. The author makes clear that his 
ering suggestions is not to facilitate the functioning of 
nded workers, but to influence the intellectual worker 
h even mundane tasks as invitations to creative think- 
le concludes his work with the statement that he has tried 
® everything in the book center on creative thinking. 
Finishing this book, the reader might legitimately ask what 
‘Psychology has made, since William James, in offering 
ie to students in their pursuit of academic success. No 
seems able to go beyond or even approach the level of 
Falks to Teachers on Psychology: and to Students on Some 

ideals published in 1899. Basically little that is new since 
How To Use Your Mind appears evident in the more 
books on ‘how to study’. Aside from the Q 3 R technique 

State University for studying chapters in the convention- 
textbook and the ‘push on’ or quick reading method 
foreign languages, there is in the book under review 
by preference to new developments in effective study 
Psychologists who choose to write ‘how-to-study’ books 
feel obliged to approach their topics as exclusively cog- 

ve matters unrelated to the emotions and the total 
. They appear to be oblivious to the psychoanalytic œ 
and to dynamic psychology in general. Mursell fails to & _ 
t the inability to understand and think effectively may 
O emotional factors as much as to poor coum ae in 
U 2 
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cognition. The failure to integrate cognitive functions with 
entirety of the personality accounts for the antediluvian 
of most ‘how-to-study’ books. i. 
Mursell has done a good job of expounding the traditional ap 
proach for developing mental efficiency. However, at times, the 
9 tye is repetitious, long-winded, and too general without the 
pensation of being inspirational—hence boring. The standard: 
writing at times seems to dip below the college level to whieh 
is apparently directed. The attempt to offer applications to ses 
pursuits appears to be simply a gefture. For thos 
simple and very clear exposition of the conventional view 
on effective use of the mind, this book may be most satisfying. 
Pamir M. Krat 
Adelphi College 


C. W. OnELL, How To Improve Classroom Testing. Dubuque, I 
William ©. Brown Company, 1953, pp. 156. 


velopment by taking the objectives of education as the 
of the achievements that are desired. He then concentrates @ 
A ees ns problems of intak 
and personality measurement. Chapters IV through 
are devoted to test construction principles and illustration® 
various types of test or problems; these illustrations 
drawn from a matter fields. The manual rf 
to be a good source of ideas for test item types, and as mw 
should be of value to teachers. One chapter on administratioe 
scoring and one on statistical methods in connection with = 
complete the volume. The chief contribution of the manual i 
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d Co., 1952, pp. 328. 


Greenacre has published many psychoana- 
dies of early personality development. These 
many journals; most of them in the Amerin 
wechiatry or the Psychoanalytic Quarterly, The © 
his volum is to make a selection from these 

more readers. The connecting link for the 
d hat they are all related to the in jon of 

jand special traumas in the first few years co 
jn this volume are ‘The Biological Econ 

psition to Anxiety; Infant Reactions to Re- 
and Weeping; Vision, Headache, and the Halo; 
luro and Superego Development; Conscience in 
A Study of Screen Memories; The Prepuberty 
i Problems of Early Female Sexual Development; 
srporation and the Phallic Phase; and Some Fac- 
Different Types of Genital and Pregenital Organi- 

lume contains a very favorable and informative 
mn by , Ernst Kris, a list of references for each chapter 
ek end ‘of the volume, and an index. 

will meet the unreserved approval of that school of 
‘and clinicians who have an interest in and include 
social as well as individual factors. ‘This group will 

to the content of the volume and the interpretation 
the tame way as Kris docs in his introduction. Kris ` 
studies as a valid elaboration of Freud's 
Ba which enables Greenacre to enlarge her vista 
ychologic: events and to reémphasize the continuity 

physiological reactions of the organism to its 
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primarily an analytic clinician, shows a clear familiarity with and 
knowledge of contributions from related fields as well as psycho- 
analysis. For example, on the physiological Jevel she shows famili- 
arity with the works of Cannon, Morgan, and Coghill. On the 
psychological level she shows familiarity with the writings of such 
costributors as Gesell and Halverson, Mandell Sherman, Sontag, 
o and Jersil H. Meurer 
Psychological Service Center "e 
St. Louis, Missouri 
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University of Michigan 


7 the Psychology Department of the University of . 
n instituted a seminar with academic credit for graduate R 

Sere teaching sections of the introductory course. 
first semester the seminar reviewed the literature on 
sin teaching methods. They then wanted to evaluate, ‘ 
teaching techniques. This experiment was undertaken t 
shop’ project of the seminar. A 
the second semester of 1946-47 there were 865 Taden 
i the elementary general psychology course at the Univer- - 
igan. The course carried three hours’ credit. Each s 
‘one hour a week in a large lecture section and two hdt t 
sections. The students were divided into three lecture _ 
250 to 300) twenty-six class sections (N = 25 to - 

Same lecture was delivered three times on Mondays and 
ys. THe two weekly class meetings were held on Wednes- 
days. The lectures were given by a senior staff member 

“s 

Owing assisted in planning and carrying out this experiment: 
Boyd, Everett W. Bovard, Robert G#Givvy, Theodore N. Hari- 
n H. Hossick, Norton B. Knight, Ernest J. London, Elizabeth © 
» Herbert H. Meyer, Elliot G. Mishler, Harvey Schrier, ham oh 
er, and Philip I. Sperling. 
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and the small section meetings were conducted by psychology 


‘graduate students. 
La 


THE PRIMARY FOCUS OF THE EXPERIMENT 


In common with others who take seriously the task of teaching, 
the .eminar group often found its deliberations ending in disa- 
O greement. We concluded that the primary prerequisite for our work 
was the construction of better instruments to evaluate the outcome h 
“a of our teaching in elementary general psychology. It was assumed 
» that the sensitivity of such instruments might be determined by 
Ca ay, ig discriminate between teachers of varying quality and 
by wnether they reflect differences in teaching procedure. It was 
important to develop external criteria of the efficacy of our teach- — 
ers and to teach sections according to widely divergent methods. 

In order to determine what evaluative measures were needed, it 
Was necessary to decide upon the objectives of the general psychol- 
ogy course. To aid in the definition of these objectives each member 
of the seminar presented his view as to what the objectives of the 
course should be. Two student sections in the general psychology 
course also discussed goals of the course. Results of these discus- 
sions and papers were then considered and the following tentative — 
~ goals were formulated: N 5 
1) Understanding of the facts and principles of psychology basic 

«, for further study of psychology or for use in other areas of life. 4 
4B Ability to apply learning to aid the student’s understanding 

of human behavior in his everyday life. 

3) Development of a scientific and analytic attitude toward hu- 
_ man behavior: (a) Understanding multi-causality. (b) Familiarity 
- with the! experimental approach. (c) Increased skill in formulat- — 
ing conclusions. (d) Increased skill in evaluation of conclusions. Qj 4 
Ability to distinguish between factland opinion. "ei 
4) Awareness of the adaptability of human nature. (a) Recog- 
nition of the limitations of attempts to change adult human nature. 
(b) An attitude of understanding toward individual differences. 
5) An appreciation Öf the value of psychology. 


THE DESIGN OF THR EXPERIMENT 


, The design of the experiment was dictated by a number of od 
ditions. First, of course, was the imperative consideration os 
oy teach the elementary Course in such a way that all 
A pi 


Q A — Via 


ae variety of criterion instruments. A number 
died a-borning; sometimes their reliabilities were 
dequate. Im the end some thirteen instrumentgvere 
pete criterion measures, Third, we noticed the rela- © 


s. Tn Eei, the dexign may be stated in the null form: 
Bll be no, significant differences in outcome with respect to 
ives between students whose class meetings are 


ethod 1 and those students who are taught by Method 
hod 3. 


THE TEACHING PROCEDURES hi 


fellows each taught three sections—one by each 


ght by each of the three methods. In addition to the 
ure, all students were given the same assignments, used 
book, and were provided with the same opportunity 
ational movies and take sample tests, as outlined in the 
lence, the students were given a common core of lectures, 


nts. The teaching method was varied only with r 

rocedures used in the two section meetings per week. ~ i 

cit announcement Was made to the students that an ex- 
progress, 

week before classes actually started the following 

asissued and discussed by the eight instructors who 

twenty-four sections of the elementary course: 

iable in the experiment for the spring semester 

ie. Three types of teaching technique would be » 


. 2 
ns taught by the course administrator were not used in the = 


D 


entially effective. In addition, we pecu ' 
2 eal in method we were hotly arguing al in 


methods. Thus eight of the twenty-four sec- ‘ah 


demonstrations, and text and supplementary read. A 
SA 


ih 


196 The Journal of Educational Psychology 


compared: Drill—recitation; group—discussion; and. study—tu- 
torial. This variable was chosen rather than class size because 
experimental evidence indicates that variatidh in size produces 
negligible variation in performance, unless technique is also varied, 
These three techniques represent different degrees of directed vs. 
non-“jrected and group vs. individual work. 

1) Recitation—Drill—This method will be instructor directed, 
instructor paced, instructor dominated. A quiz will be given at each 
class meeting. The instructor will bring a list of prepared questions 
to class. Students will be called upon to answer and will be marked 
ieee stan. Each day the instructor will present a short 
lectixe-drill on the fundamentals. E 

2) Group—Discussion.—The discussion section is to be regarded 
as a supplementary tool used in conjunction with the text, the 
lectures, and the readings. It operates on the assumption that 
merely telling the student or having him read is insufficient. 

Students must formulate and discover the relationships them- 

selves. Instead of drilling facts, it attempts to clear up misconcep- 

tions and integrate the concepts of the course. Š 

The discussion group creates an atmosphere in which the student 
feels to expose his misconceptions, so they can’ be corrected; 
he is encouraged to bring in the problems which have troubled him 
in the/assignment; he is stimulated to utilize the basic concepts of 

_ the course to interpret incidents which occur in his day-to-day 
living. An attempt should be made to establish the sense that the 
class works together; each person learns from the other as well as 
from the instructor. Being a member of a group, he is carried along 
by the group’s enthusiasm and interest. The student should be 
given to feel that only by actually participating in the discussion 
will he, himself, gain a usable understanding of the content of the 

course. k 

Functions of the Ñistructor: —At various times the instructor will 
serve in the following capacities: Chairman: presiding, deciding 
order of speakers, etc.; Summarizer: at appropriate times through- 

* out and at the end of each class meeting, he should summarize the 
ideas discussed. Sometimes this summarizing ig vi expe 
itself as a resolution of a conflict among diseussants. Each Sa 
meeting should end with a sense of having actually covered groun! : 
» Discussion Stimulator: on needed occasions the instructor will as 
yeh a ae read case-histories, perform demonstrations, 
3 a 
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ite fruitful controversy and increase the ‘reality’ of 
Informant: at times the instructor will supply data 
owledge, because of his background in psychology. 
Will the instructor lecture. If he desires to present mate- 
included in the readings or lecture, he should ask a number 
students to prepare a symposium on the topie, @ use 
student-participant device. 

Facilitating Devices.—In order to facilitate the discus- 
nosphere, the following techniques are suggested: Tech- 
0 reduce the restraints#(1) no grade should be given on 


aU shape, it may be possible to remove part of the tradi- 
lasstoom restraint; the students then can talk to each other 
to the instructor; (3) have the students introduce them- 
he first day; supply the class with a seating chart and 
fellow-classmates; use first or only last names (omitting 


spread participation: (1) encourage comments 
ticent students; quash ‘stars’ who monopolize; (2) after 


‘ask backward students to bring written ae 

A eta Sometimes have all the students answer a 
Mby writing out the answer during classtime; this 

oorer students a chance to participate. Have students 
‘facts and answers in their text, so that poorer ones can 
wers and learn to contribute, using text as prop; (8) essay 
Ae tests, once every other week, except when course 


ven. 

jon instructors might introduce their work by mention- 
iis experiments on group decision. 

tal—Study.—This method will emphasize an individual- 
ach with the student pacing his own work within the 
of the course. No quizzes will be given, but occasional 
papers will be required. The class work will be relatively 
sompared with the Drill—Recitation method. The 


i but will be available to the students for consultation 

Out the hour. The students will utilize the hour for study e 
onsultation. The ie u endeavor to help tT. 
; SW j 


ion itself; this may allow the student more ee aa 
his ignorance and misconceptions; (2) by arrangifig the 


a 
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to find the answer to his questions rather than answering them 
himself. Social conversation will be prohibited. This method is 
based on that reported favorably by Scheidentann (Sch. and Soc., 
1927, 25, 672-674.). 


* 
INSTRUCTOR VARIABILITY 


0 ahiccvaing teaching methods, one is always faced by the prob- 
lem of the interaction between teacher and method. We chose to 
assign each instructor to teach three sections—one by each of the 
three methods. To reduce differences in instructors’ abilities and 

ore Dah, pig for teaching the different methods, the entire group 
‘actively participated in the general planning for each teaching 
method. In addition, three special committees were appointed, one 
to represent each of the three methods. Each committee was 
responsible for seeing that its method was used effectively. Thus, 
the ‘recitation committee’ worked out quiz questions and other aids 
for the Recitation—Drill classes; the ‘discussion committee’ pro- 
vided lists of suggested discussion questions for each day’s Group— 
Discussion classes, and the ‘tutorial committee’ provided demon- 
strations and reference books to be taken to the Tutorial—Study 
classes. We attempted in this way to make sure that each method 
got a fair trial. k 

The time sequence of methods was varied from instructor to 
instructor so that no method would consistently be used for the 
instructors’ first sections to be taught during the week. All permu- 
tations of the order of methods were used. 


EQUATING OF GROUPS 


While we did not know what factors enter into students’ choice 
of sections, we suspected that we could not assume that our classes 
were equated in intelligence and other possibly relevant variables. 
During the first an second class periods of the initial week of p 
semester, basic sampling data were obtained and an Otis Test 0 
Mental Ability (Higher Form A) was administered to each section. 

To carry out the process of equating groups, summary sheets 
for each section were compiled. For all hours, with two exceptions, 
all teaching techniques were represented each hour. It would an 

© been inconvenient to attempt to change the time-schedule oe 

> students. Hence, students were shifted to equate sections Wi z 

each p Although it was not possible to obtain complete equ 
\ 2 


® 
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‘or all sections on all characteristics, we were 
udents of each section typical, in the sense that 
iable number of women, Veteranai etc., and 


whether an experiment was being condueted in 
told that because no crucial experiments on teach- 
done, various methods were always being tried out by 


arily on the basis of scores on objective achievement 
e course-wide objective tests were given. The first took 
id of the initial four weeks of the course. A mid-term 
as given after eight weeks of the course, and a final 
| at the completion of the course. Coverage in these 


J 


4 Pine choice aip of the diet -answer type 


used for constructing the examinations was as fol- 
eyer was placed in charge of the construction of the 
costs, detailed tables of results and all measures used ý 
nt are included i in an appendix which has been deposited 
ican Documentation Institute. Order Document No. 4048 
locumentation Institute, 1719 N. St., N. W., Washington Ee 
$2.00 for 35 mm. microfilm or $3.75 for 6x 8" p copies. 
° 
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examinations. The other teaching fellows submitted questions and 
suggestions on the respective assignments. On the basis of the 
course objectives and the material covered, Me¥er drew up a table 
of specifications for the examination to be given, selected items 
submitted, and wrote new items to fit the table. This preliminary 
editi of the list was submitted at a meeting of all teaching fellows, 
Q and items were altered or eliminated. The revised examination was 
then further edited by Meyer, and the final edition constructed. 

Since one of the objectives of the course was that the students 
should be able to apply to life-situations the principles they had 

Sy, many of the items were ‘application’ questions. The ques- 
tion then arose as to whether such questions were*actually measur- 
ing the intelligence of the students more than their knowledge of 
psychology. This hypothesis was tested by dividing the final ex- 
amination into two parts, consisting of fifty questions testing the 
memory of facts and fifty questions of the application type. The 
scores of these parts were correlated with the students’ scores on 
the Otis examination, which was used as a measure of intelligence. 
The hypothesis was sustained to some extent. The correlation be- 
tween Otis and Factual question scores was only .17, while the 
correlation of Otis and Application question scores was .41. The 
t-test of the difference between the correlations is 1.6—significant 
at the .05 level. ; 

USAFI Examination.—In addition to the tests and final exami- 
nation, the United States Army Forces Institute Examination in 
elementary psychology was used as a measure of achievement. In 
1946—47 this test was one of the few standardized achievement 
measures in general psychology. 

We had also hoped to use essay tests as measures, and spent 4 
good deal of time attempting to develop reliable scoring methods. 
However, we were unable to raise the scorer-reliability above 60 
on the kind of broad questions which seemed necessary to us if we 
were to develop measures of a more than the achievement 
sampled by our objective tests. Hence, essay tests were not included 
in our battery of measures. 

*  Misconceptions.—One of the objectives of the course was to t 
» in such a way that the student’s knowledge of psychology a 
carried over to his everyday thinking. If this were accomplish A 
» common misconceptions about human behavior should be effecte! 
r y D 
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ptions submitted by the teaching fellows and from 
e's list (7). Part 1 consisted of eleven multiple-choice items. 
K isted of seventeen true-false items, the conventional 
for misconception items. Part 3 consisted of ten proverbs. Q 
ements were common stereotypes meeting acceptance 
y conversation, which from a psychological viewpoint 
not completely accurate. The students were asked to decide 
the proverb contained more truth or falsehood. Thee 
y of this test was .73 (estimated for the whole test). 
ptions of parenthood tests—Duvall (2) has classified re- 
of four hundred and thirty-three Chicago mothers of differ- 
levels to the two questions: ‘What are five thingsa good 
oes?” and “What are five things a good child does?” into 
gories—‘traditional’ and ‘developmental’. The traditional 
are characterized by a desire to make children conform 
id conception of the réles of mother and child. The develop- 
responses are characterized by respect for the child as a 
and by expectations of mother and child in terms of growth 
elopment rather than as specific behavioral conformities. 
in general psychology might be expected to produce a 
levelopmental conception of the rôles of mother and child. 
hypothesis a test was created by McKeachie using re- 
listed by Duvall as typical of traditional and develop- 
mental conceptions. Split-half reliability of the test was .80, esti- 
for the whole test. 
ntific and analytic attitude toward human behavior—McCand- 
6) test of analytic attitude toward human behavior was our 
re in this area. 
Measure our objective of creating a favorable attitude toward 
ology an eight-item Likelittype scale was constructed. The 
reliability of the test was .75. As another indication of the 
of the students toward psychology, students were asked 


on a list of those offered to undergraduate non-majors. AM 
ly significant measure of the student estimate of value of 
entary psychology course is the number who decide too 
9 

tes 


the advanced courses in psychology which they would like,* , 
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TABLE 1.— MEAN Scorn Summary or RESULTS 


A GA ial. | CR for Differ- 
Score Score Bieta 
9 Final, Zxamination 97.7 | 96.5 | 95.8 2.7 
USAFI 57.2 57.4 57.3 s 
Misconceptions 49.6 49.3 48.9 h: 
Parenthood 8.9 8.6 8.8 s 
Analytic Approach 7.6 7.6 7.8 y 
Attitude Toward Psychology 21.3 21.9 20.8 1.8 
-—— _ Jer of People Planning to Con- | 19 7 11 2.4 
centrate © 
Number of Advanced Courses Stu- | 4.3 3.9 3.6 2.9 
dent wants to take 
* Less than 1.0. 


concentrate in psychology while enrolled in the course. Students 
were asked at the beginning and end of the course in what subject 
they intended to concentrate. 


RESULTS IN AREA OF ACADEMIC ACHIEVEMENT 


The traditional method of measuring course outcome is a final 
examination. One of the students’ chief goals in a course is to pass 
the examination. As indicated in Table 1, with respect, to this con- 
ventional criterion our three methods ranked in the order: Reci- 
tation—Drill; Group—Discussion; Tutorial—Study. It should be 
noted that the absolute difference between the means of Recitation 
and Tutorial method is only 1.9 score points, and, although statisti- 
cally significant, it is so small as to be of dubious social significance. 

Not only are the differences between instructuors not statistically 
significant, but there was no significant interaction between in- 
structor and method. These results bring small comfort to those 
who look upon teaching as an art dependent upon the personality 
of the instructor. 

What was the relative effectiveness of each method for different 
degrees of intelligence within the range of this class? Measured by 

Scores on the final examination, the recitation method was found 
to,be more effective for all levels of intelligence in our group. This 
>was contrary to our general expectation that the recitation method 
when) Sa for poor students, but not for bright ones. 
b } 
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ted States Armed Forces Institute Examination in ele- 
sychology (Form A) was given to about two-thirds of 
dents enrolléd. All of the students in designated sections took 
so that there was no selective factor. Students were in- 
d that the test would not count in their grades, and they need 
nake special preparation for the test. They were urgedyto do 
best, but inasmuch as they knew their score would not count G 
ade, many students were probably not as highly motivated 
t had been their final examination. In spite of this lack of 
vation, a mean score for all three groups was about the 70th 

le on the norms given for the test. Differences P oer 
for groups taught by different methods were not significant. 
-The ossibility that one method stimulated or forced students to 
do mo work outside of class than the other methods used was 
ered. The students were asked during the fourteenth week 
ate, anonymously, the total time they had spent on psy- 
hology outside of class in a two-week period when there had been 
10 ¢0 e-wide tests. Results showed negligible differences between 

methods. This measure may be inappropriate, in that a 
teaching method may stimulate a student)to more study, 
a poor one may so confuse him that he is forced to spend more 

study. 


UTCOME IN THE APPLICATION OF PSYCHOLOGY TO 
EVERYDAY LIVING 


Horrocks (4) we find nothing in our results to encourage 
that one of our methods is superior in teaching students to 
psychology to life. There were no significant differences be- 
ethod on either the misconceptions test or the parenthood 
i However, like others who have constructed misconceptions 
s, we found that college students accept a number of common 
conceptions. Every misconception in our test, was accepted as 
at least some students. And this after the elementary semes- 
urse! 

There were no significant differences between methods measured 


Such negative results may be due to insensitive measures of ineffective 
les. A pre-course post-course comparison of scores on these measufes ‘a 
btained later. There were no significant gains on either the misconcep- 


or the analytic thinking tests. 3 © 4 
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Results of our scalé of attitude-toward-psychology showed stu- 
dents in the group—discussion sections to possess slightly more 
favorable attitudes toward psychology than students in tutorial— 
study sections (P = <.03), with students in recitation not signifi- 
cantly different from either of the others. However, most students 
in al*sections had distinctly positive attitudes toward psychology 
as measured by our scale. 

Students being taught by different methods differed in the num- 
ber of advanced psychology courses which they wished to take, with 
Recitation—Drill again leading. The critical ratio for the differ- 


Œ= Hof the means of the recitation and tutorial sections is 2.9, 


which is statistically significant. A follow-up study three years 
later indicated that students in recitation sections actually did 
take more advanced psychology courses than students in other 
sections. Note that in the class as a whole, regardless of method, the 
average student would like to take four more psychology courses. 
Frankly, we were somewhat surprised by this interest in further 
course work in psychology. 

While the numbers involved are small, a significantly larger 
number of students in recitation sections chose to concentrate in 
psychology than students in the other two types of sections. 

As another measure of the value attached to the course we had 
hoped to use the number of cancelled registrations. However, ex- 
clusive of those who dropped the course during the first week, i. e., 
«before the different techniques of instruction were introduced, only 
seven students dropped out of the course. Three of these students 
were in discussion sections and two each in recitation and tutorial 
sections. Contrary to the prophecies of some of the local Jeremiahs, 
there was no mass exodus from any method. 


ATTITUDES OF STUDENTS AND INSTRUCTORS TOWARD 
TEACHING METHODS 


An indication of the students’ attitude toward the method of 
instruction used is the shift in their preferences in the beginning of 
the experiment to their preference at the end of the course. At 

_ the beginning of the semester the recitation and discussion methods 


à were approximately equally preferred and the tutorial somewhat 


less so. Students in recitation classes showed a statistically signifi- 


> cant gain in their preference for recitation, while students in dis- 
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tutorial sections showed no significant changesin prefer- 


the relationship between the student’s preference for a 
and his success in learning by that method, as measured by 
content examination? It will be remembered that students 
were matched on the basis of grade-point averaggand 
within each method. Using scores on the final exami- @ 
a criterion, we found no significant differences between 
who initially preferred the method used and those who 
ranked as least preferred the method to which they were 


subjected. 

ttitudes of the instructors toward the three methort of 
were assessed at the beginning of the semester and again 
end of the sixteenth week of the semester, before any of the 
rs had knowledge of final results. At the beginning of the 
er, there was a slight preference for the group discussion 
with recitation-drill and study-tutorial methods preferred 
t equally, By the end of the semester preference for the reci- 
om method had increased to approximately equal that for dis- 
while preference for tutorial had dropped appreciably. No 
ing fellow ranked the tutorial method higher at the end of the 
ent than he had in the beginning. Tt is interesting to note 
hile some instructors still preferred teaching by the discus- 
i method and expected results to justify their preference at the 
the semester, their preference was not borne out by the_ 
s. Yet most of our teaching is based upon what methods 
prefer. 


DISCUSSION 


‘and large, we found no differences between the three teaching 
ds from the point of view of educational outcome. The few 
cal differences, in general, favored the recitation-drill 
hod. The results of the experiment, of course, were contrary 
original expectations. They constitute a clear confirmation 
general conclusion derived from experiments on instructional 
lures since the early 20’s. As Good recently put it, after , 
g the literature in this area, “The complexity of the tech- g 
learning process is such that attempts to establish the relatiye 
of a ‘general method’ of teaching are likely to prove incon- æ 
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clusive” (8, p. 275). Our results are thus in line with other investi- 
gations, despite the fact that our teaching methods seem to be 
much more extreme in their differences than had been the case in 
earlier experiments. 
One of the criticisms which is commonly made of the earlier 
expenjments on instructional procedures is the limitation of the 
) appraisal instruments, inasmuch as they usually measured only 
memory-knowledge outcomes of the particular course. Our experi- 
ment included a number of measures of attitude. Yet we obtained 
few differences among our groups with respect to terminal attitudes 
aM course. Perhaps the ‘general methods’ comparison is also 
g to prove unfruitful even when a broader set of educational 
outcomes are evaluated. 

Our experiment is inadequate in two important ways. Little work 
was done using delayed measures, which might have turned up dif- 
ferences between our methods after six months or more had elapsed 
after the end of the course. Perhaps an even more significant inade- 
quacy of our experimental design rests in the fact that the instruc- 
tion was given by inexperienced teachers who were just at the 
beginning of their college teaching careers. Cogent to this limita- 
tion is Peters’ (6) experience that as his high-school teachers con- 
tinued their work with a new technique into the second year, the 
difference between his traditional and experimental groups became 
stronger, as exhibited on a test of information. j 

Perhaps the most important factor which tends to eliminate dif- 

“ferences in comparing different teaching methods is the fact that 
thé experimental manipulation of the instructional procedure af- 
fected such a small segment of the students’ learning process. The 
student comes to us from a long series of classroom experiences 
which have emphasized content acquisition. His three semester 
hours in the elementary psythology course are only three in the 120 
semester hours he needs for his graduation from a large university. 
Even within our course we gave common examinations to all of 
our students, regardless of the teaching methods used. They at- 
tended the same lecture each week. They all had opportunity to 
see the same movies. With well-matched student groups and a de- 
sign which held the personal characteristics of the instructors Con- 
stant, we obtained no differences among our students when we 

, subjected them to comparatively small differences in environment. 
D There 5 more difference between our teaching methods than 12 
ò > 
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‘experiments reported to date. We used a wider variety 
measures than had been used up to 1946. Yet, the 
our ifistructional procedures were too small, when 
with the other factors which remain constant, to greatly 
outcomes of the learning process. 

l believe it is of vital importance to gain a better ugder- 
g of the effects of differences in instructional procedures. 
fact that the classroom situation involves so many varia- 
that the interactions among these variables are consider- 
that more fundamental research is needed upon the 
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VALIDITY OF AN OBJECTIVE EXAMINATION 
FOR THE PLACEMENT OF FOREIGN 
STUDENTS IN ENGLISH COURSES 
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With the post-war influx of foreign students to American univer- 
sities the problem of adjusting the college curriculum to the needs 
of foreign students became a pressing one. The need to place for- 
eign students in English courses at levels appropriate to their 
mastery of the English language has been one of the major factors 
leading to revisions in the curricular offerings for foreign students. 
Tn addition, it forced the exploration for techniques by which such 
students could be placed in appropriate English courses. X 

A common-sense approach suggests that the student’s experi- 
ence with English in his native land would be a satisfactory basis 
for placement of foreign students in English courses at American 

* universities. Schueler reflects the experience of many college in- 
structors, placement advisors and administrators with the com- 
ment: “Tt has been found that one of the most unreliable indexes 
of a student’s knowledge is the number of years of formal English 
study that he had in his native land. It has become necessary, there- 
fore, to measure the student’s command of English before admit- 
ting him to the abena section in the program. [The Queens 
College English Language Institute for Instruction of Students of 
Heterogeneous Linguistic Background]” (3). 

The placement of foreign students in English courses became 
particularly acute at Columbia University which admits approxi- 
mately a thousand foreign students each year for undergraduate 

® and graduate study. The students, who come from almost every 
country of the world, vary in amount of education in general, in 
= quality of preparation in the English language, and in the lengths 
N ? 208 
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ey have lived in the United States before coming to 
ey Because of these wide differences, it became 
to provide English courses at levels appropriate to the 
y of English these students have. 


ropriate English courses. Two course sequences are pro- 
foreign students, one for written composition, grammar 
g, and the other for oral composition and conversation. 
n test is for placement of students in the writing se- 
the oral test for the speaking sequence. 

ent study reports on the relation between the written 
‘success of foreign students in courses in writing and 


ENGLISH EXAMINATION FOR FOREIGN STUDENTS 


parallel forms of an English Examination for Foreign Stu- 
were developed by the Institute of Psychological Research 
ers College, Columbia University. The current forms of — 
have been revised on the basis of extensive item analyses 
g thousands of students: test-retest, and test-alternate 
orrelations indicate that both forms are reliable. 
test, with its four multiple-choice subtests, requires sixty 
for administration. To minimize confusion in answering, 
‘students mark their answers in the test booklet by drawing. 
around the correct response. 
One and Two are a thirty-five item and a ten-item subtest, 
ively, measuring English usage. The items are based upon 
ial usage errors made by foreign students. Part Three is a sixty- 
i vocabulary test, and Part Four is a reading comprehension 
three brief passages each of which is followed by five four- 
estions. The total score (maximum 120 points) over all 
points is obtained for each student. A more detailed descrip- 
the English Examination for Foreign Students is given in a 
nt article (2). 


ENGLISH COURSES FOR FOREIGN STUDENTS 8 


ierarchy of courses in writing and reading for foreign stt- 
the School of General Studies of Columbia University 
© 
\ © 


© 


; ral, were developed as aids in the placement of stents 4 
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follows: 
Name Title o | Semes=) Points 
G.8. English eV1 Introductory English for For- 1 4 
9 eign Students 

G.S. English eV3 Intermediate Written Composi-| 2 4, each 

and eV4 tion, Grammar and Reading semester 
G.S. English V9 Advanced Written Composi-| 1 4 

tion, Grammar and Reading 

G.S. English A3 English Composition for For- | 1 4 


eign Students (equivalent to 
the freshman course in Eng- 
lish composition) 


iT 


COURSE PLACEMENT ON THE BASIS OF THE ENGLISH EXAMINATION 
FOR FOREIGN STUDENTS 


Cut-off scores were, established! for the placement of students 
into the five English courses. The primary criterion is the score on 
the English Examination for Foreign Students (EEFS). Advisors, 
however, in considering other information about the students, do 
not adhere to the established cut-off points. Frequently, indeed, 
students are placed in a course despite the fact that their EEFS 
scores are above or below the established cut-off points for that 
course. 

The cut-off points! established for eV1 Introductory was 0—40; 
for eV3 Intermediate 1, 41-55; for eV4 Intermediate 2, 56-70; for 
V9 Advanced, 71-90; and for A3 Freshman English, 91-120. 


THE PRESENT STUDY 


The present study is a report on the value of the English Ezami- 
nation for Foreign Students in predicting success in English courses, 
where success is defined in terms of the final grade assigned in the 
course. Final grades cover an eleven step range: A+, A, A—, B+, 
B, B-, C+, C, C—, D, F. 

For three hundred and seventy four students, who had taken the 
English Examination for Foreign Students, final grades were avail- 


ə able on completion of one of the five courses. Only the initial course 


ð Pn These cut-off points have been modified since the present study was 


initiated. 
ş S 
> } 
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gives the essential information. In addition, the final 
bution is reported for each course for those students 
scores were within the established range for the course 


range. 


CORRELATIONS BETWEEN EEFS SCORES AND 
FINAL GRADES IN ENGLISH COURSES 


product-moment correlations between EEFS scores 
s in English courses are given in Table 1 for the five 
These range from .31 to .65. 
obtained validity coefficients resemble those of other in- 
s for English speaking students. Huddleston reported 
correlations between four different objective and subjective 
sures of English writing competence and freshman Eng- 
des as ranging from .23 to .46, with the median correlation 
an objective English test and course grades a8 .34. In 
‘intensive study, Huddleston reported a correlation between 
tive English test and high-school English grades of .60. 
review of the literature on objective English test prediction 
English course grades, Huddleston indicated that the range 
y coefficients was from .11 to .73, with the median about 


obtained validity coefficients for the foreign students are 
upon restricted ranges. The EEFS scores for students in any 


(.—Enquisa EXAMINATION For ForsIGN STUDENTS: VALIDITY 
CIENTS BETWEEN Scores AND FINAL Grapes: BY ENGLISH 
Course 


vs | eae 
Course and Level No. of Students Coenen Curtailment of 


63 .38 -45 

76 -40 -60 

34 65 .87 
110 .31 -56 

91 45 76 © 
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course are not distributed over the entire range of EEFS scores. 
Such curtailment of range on one variable tends to reduce the ob- 
tained validity coefficient. To estimate the validity for the full test 
range the correlations were corrected for restriction of range on 
one variable. (4) The corrected validity coefficients are presented 
in Table 1. 

Thus, either on the basis of the obtained, or of the corrected, 
validity coefficients, the EEFS seems to result in predictions as 
satisfactory as those for objective tests of English for native Eng- 
lish-speaking students. ; 

owever, while the ability in English of the native English- 

speaking student may be considered stable by the advanced high- 
school or college level, that of the foreign student who has just 
begun his study of English is probably at its period of greatest 
change. The changing proficiency in English of foreign students 
during this period, and the vast individual differences in potential 
improvement (dependent upon intelligence, similarity between 
native language and Phglish, experiences in the United States with 
English-speaking people, etc.) would tend to lower the validity co- 
efficients between EEFS scores and final grades in English. Thus, 
despite such limitations, the validity coefficients between EEFS 
scores and final grades seem satisfactory. (1) 


GRADE DISTRIBUTION IN ENGLISH COURSES FOR STUDENTS SCORING 
ABOVE, WITHIN AND BELOW ESTABLISHED CUT-OFF 
SCORES ON THE EEFS 


The final grade distributions for students with EEFS scores 
within and outside the established range for each course are pre- 
sented in Table 2. 

In general, the final grade distribution for students within the 
established EEFS cut-off scores ranges from. A to F: the largest pro- 
portion receive B’s, and considerably fewer get A’s and C’s. D’s 
and F’s vary from none to nine per cent for the different courses. 
The exception is the first course: G.S. English eV], in which seventy- 
eight per cent of the students receive A’s and the others B’s. 

For students with BEFS scores above the established upper limit 
for the particular course to which they were assigned, the general 
tendency is for a majority of students to receive A’s with relatively 


, fèw receiving C’s. Only G.S. English eV3, the first intermediate 


course, 5 an exception to this pattern with twenty-three per cent of 
Y 
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EN ESTABLISHED CUT-OFF Score RANGES AND FINAL 
DE FOR SPECIFIED COURSES: For ALL SCORES 


blished cut-off scores for the course. 


s with placement test scores above the established cut- 
receiving grades of C or lower. This suggests that a revision 
cut-off points for this course may be needed.? 

For dents with EEFS scores below the established lower limit 
the particular course to which they were assigned, the general 
t is for the majority of students in each course to receive 
Cor lower. There are very few A’s received by this group, 
e per cent of B’s is usually lower than the per cent of C’s. 
grade distributions indicate the value of the English 
ation for Foreign Students in the placement of students at 
propriate level in English. The grade distribution for students 
BEFS scores within the established range is satisfactory; for 
with scores above the established range tends to be limited 
the upper end of the grade range; and for students with 
below the established range tends to be limited toward the 
end of the grade range. 


modification has been made. c ( 
\, © 


Final Grade Distribution in Per Cent 
No. of 
Students 
A B c Dpi Fi @ 
19 78 pas 
44 43 39 4 
36 31 47 6 
27 11 74 = 
13 Sp 31 
2 100 = i 
16 25 63 VE 
16 6 38 6 
12 50 42 ~ 
64 19 45 5 
34 15 38 12 
58 16 53 9 
33 3 36 12 
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SUMMARY AND CONCLUSIONS 
The validity of an English Examination for Foreign Students wax 
tested against the criterion of final grades in English courses for 
students assigned on the basis of scores on the English test. Foreign 
students entering Columbia University are required tò take-an 


D objedtive English examination on the basis of which they are placed 


in one of a hierarchy of five courses in written composition, gram- 
mar and reading. Final grade distributions were contrasted for 
students with placement test scores within and outside the estab- 
i range for the course, 

tained correlation between English Examination for Foreign 
Students scores and final grades range from .31 to .65 (corrected 
for range of ability, 45 to .87) validities compare very favorably 
with those of other investigators between scores on objective tests 
with final grades in English courses for native English-speaking 
students (1). 

The final grade distribution for students with placement test 
scores within the established range for the course tend to be dis- 
tributed over the grade range, with the largest per cent of students 
receiving B’s and fewer receiving A’s and C’s. For students with 
placement test scores above the established limits, final grades 
bunched at the upper end with most students receiving A’s; for 
those with placement test scores below the established limits, 
grades bunched at the lower end of the grade distribution with 
most students receiving C’s and a considerable number receiv- 


- ing F's. 


The data suggest that the English Examination for Foreign Stu 
dents is a useful and valid measure for placement of foreign students 
ee meee et enerred mastery of Ent- | 
lish. 
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HAND LOW SCHOLARSHIP 
COLLEGE STUDENTS 


F. BROWN, NORMAN ABELES and IRA ISCO 


The University of Texas 


of the literature concerned with the study of factors 
student success and failure in college discloses a variety 


have revealed many personality differences between good 
t students. Ticbout (10), in his three-year clinical study 
nts at Sarah Lawrence College, found great similarities in 
ioral patterns of girls whose scholastic records were lower 
ence test scores would predict. He concluded that his 
pointed to the existence of a behavior syndrome which might 
terized as follows: (A) A need to rely upon strong and im- 
motivations to begin studying; (B) A tendency to possess 
8 of a transitory, impermanent nature; (C) A tendency to 
ned by strong hedonistic principles; and (D) A deep-seated 


of the studies concerned with this problem of differences 


questionnaire method. Several of these reports (1, 2, 8, 4, 
$ 11) seem to substantiate the hypothesis that existing study f 


tudents. Brooks and Heston (1) in a study concerned with 

lity of items in a study habits inventory found that the 

stionnaire scores correlated .18 with cumulative grade averages | 
W? with ACE scores. Their sample was composed of two 

and seventy-six college students. Of the total seventy- 

on the questionnaire, twenty-cight items were igen to 
nt. 


(8) prepared a 100-item study habits questionnaire wich. 
ered to one hundred and twenty-one students enrolled 
inning course in educational psychology. ‘The correlation ç 

“questionnaire scores and final examination grades was 
or the one hundred and twenty-one students. Other studies < 
215 to l J 
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high-scholarship and low-scholarship students have u r 


$ Or questionnaires do not adequately distinguish good from} 


tions (1, 3, 5, 7, 8, 9, 10, 12). Studies utilizing interwmas== 
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(2, 5, 8, 9, 11) utilizing the questionnaire method for ascertaining 
the study habits of high- and low-scholarship students have re- 
ported similar results. a 
An analysis of the items on the various questionnaires seems to 
indicate that the items were concerned mostly with specific study 
technigues rather than underlying motivational differences. The 
d studies to be presented here were designed to investigate motiva- 
tional differences in terms of conformance with academic require- 
ments and the willingness to take decisive actions. Three separate 
studies, each concerned with a different phase of this problem, will 
S&=giscussed. We shall name them as follows: The first, or Prelimi- 
nary Study; the second, or Student Attitude Questionnaire; and the 
third, or Classroom Study. Each of these revolves around the at- 
tendance of students at these studies. In the case of the Preliminary 
Study a sample of college students was asked to participate in a 
graduate student’s research project. In the Student Attitude Ques- 
tionnaire, a sample of college students was asked to attend sessions 
at which a questionnaire was to be administered under the auspices 
of Cornell University. The Classroom Study concerned a group of 
students taking a beginning course in educational psychology who 
were given the chance for additional help with their course work. 
This, then, briefly differentiates the three studies. 


METHOD AND RESULTS 


A) The Preliminary Study: Method—The sample consisted of 
one hundred and forty-three second-semester freshman students, 
both men and women enrolled in the College of Arts and Sciences 
at the University of Texas. Ninety-seven students from the Dean’s 
Honor List and forty-six students from the Scholastic Probation 

E List were selected chiefly because they had comparable scores on 
the ACE (1941 Edition). The one hundred and forty-three students 
selected represented all the students from each list whose ACE 
total scores fell within certain selected limits. The upper limit (139) 
was based upon the highest score made by a freshman on scholastic 
probation and the lower limit (65) was based upon the lowest score 

. made by a freshman on the Dean’s Honor List. The difference be- 
' tween the means of ACE scores made by the ninety-seven students 
on the Dean’s Honor List and the forty-six students on Scholastic 
» Probation was not significant. 
2 The g hundred and forty-three freshmen thus selected were 
D $ 
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esting that they report to one of the regular testing 
he Department of Psychology to codperate in a psycho- 
dy designed to investigate sex differences in study habits. 
nphasized that the students had been randomly selected 
e freshman class. They were told in the letter that the study 
about one hour and that they could come in any¢ime 
a designated three-day period to take the questionnaire. 
laced on the need for all a ects to appear so that the 


‘those who had not eae were sent a second letter re- 


ion were individually phoned and definite appointments 
de with these students to appear at a specified time, 


RESULTS: PRELIMINARY STUDY 


‘one was in the hospital and two ae if attendance was 
ory. When told ome they would not be forced to appear 


ng students on oaste probation, only one admitted hav- 
ved the two letters (i.e., when contacted over the phone) 
te the fact that none of the letters were returned by the Post 
. All of the Probation List students who failed to appear had 
mtacted three times by phone and had made three separate 


.—ArrmnpaNncs or HIGH- AND LOW-SCHOLARSHIP FRESHMEN AT 
PRELIMINARY STUDY 


Per Cent Per Cent tal 
Sees asean ‘ending per Cent 

r First T Secon 
dents Letter Letter after Phoning | Attending 


97 | 80% (78)* | 15% (14) | 2% (2)** | 97% (4) 
46 | 13% (6) |15% (7) | 22% (10) | 50% (23) 


n brackets refer to actual number of students attending. — p 


‘the Dean’s list having already taken the ine hit 
\ (6l 
(a) 


the importance of their coöperation. The remainiņgas== 


c 


C 
low percentage should be considered i in the light of ninety-five 
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than three-fourths of the Dean’s Honor List students appeared 
at the first session, while only a little more than one-eighth of the 
scholastic probation group appeared at this séssion. Furthermore, 
the largest number of the low-scholarship students to appear did so 
only after phone contact. 

B) The Student Aititude Questionnaire: Method.—This study took 
place in April, 1952, when the University of Texas was asked to 
codperate with ten other universities in a study of student attitudes 
concerning the draft, the Korean War, college life, vocational aspi- 
rations, religious ideas, government and politics, etc. Accordingly, 


sA alphabetical roster of all undergraduate male students at the 


ə 


University of Texas was prepared and every seventh student was 
sent a letter signed by the President of the University. This letter 
asked that the student attend any one of the twelve sessions 
scheduled during a specified two-day period. The student was in- 
formed of the nature of the questionnaire and also that he had been 
randomly selected. Codperation was strongly urged as well as the 
fact that other universities were participating in this project so 
that the results would reflect upon his school. In general the letter 
was so worded as to make the student feel privileged to participate, 
rather than making this seem like a distasteful task that had to 
be done. 

After the first session the procedure was similar to that used in 
the previous design. A second, more strongly worded, letter was 
sent which ‘ordered’ the student ‘to report’ to any one of four speci- 
fied testing sessions. The students who had not appeared at the 
second session were individually contacted at least once over the 
phone and appointments were made with them to attend special 
testing sessions. A point was made to talk with the individual stu- 
dent concerned in addition to leaving messages with his roommate, 
landlady or housemother. This study, then, comprises a total of 
three hundred and sixty-seven students; namely, those who at- 
tended the first sessions (two hundred and twenty) and those who 
were contacted by individual phone calls, whether they eventually 
attended one of the sessions or not (one hundred and forty-seven). 
For the purpose of comparison in this study, the students who 
attended one of the session in response to the second letter, as well 
as those who in one way or another could not be contacted (.¢., 


» Moved, drafted, sick) were eliminated. 
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ting of the categories may simplify the situation: 
‘Number 


9 
of letters sent to original sample* 660 
ttending first scheduled sessions (first 220 Group A 


ding second scheduled session (second 243 Group, B 
ing third scheduled session (phone 81 Group C 


cted (phoned) but not appearing 66 Group D 
nated from study (withdrew, ill, etc.) 53 Group X 


rdo attendance was available in that each questionnaire aeg 
an attendance slip. Upon completing the questionnaire a student 
name on the slip and turned it in separately. By noting the session 
a record was thus available while the anonymity of the students’ 

as preserved. 


to recapitulate, a comparison between groups A and CD 
) was carried out in terms of grade-point averages. These 
ted on the basis of A = 4, B = 8, C = 2, D = 1 and 
information was taken directly from the records in the 


_ RESULTS: STUDENT ATTITUDE QUESTIONNAIRE 


findings of this study may be found in Table 2. A chi-square 
‘was carried out and indicated a difference between Group 
Groups C and D which was significant at the .001 level 
8, d.f. = 3). We may note, therefore, that the students 
g promptly after receiving their first letter had signifi- 
er grade averages than those who had to receive two 


GRADE POINT AVERAGES or STUDENTS APPEARING PROMPTLY 
US LATE APPEARING AND NON-APPEARING STUDENTS 


Number of Students | Number of Students Total 
Group A Groups C and D 
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letters and at least one phone call. If a division is made at a cutting 
point of above and below a 2.00 point grade average (roughly a C 
average) the following breakdown results: ` 


Grade-point Average Group A Groups C and D Total 
P200 and above 159 82 241 
Below 2.00 61 65 126 
Total 220 147 367 


"Phe chi-square analysis is significant at the one per cent level of 


1) 


confidence (X? = 9.99, df. = 1). The distribution of frequencies 
reveals that, for the group considered, the higher scholarship stu- 
dents are more willing to at least comply with requests from the 
university authorities than low-grade-point students. 

C) The Classroom Study: Method —The sample of this study was 
composed of students in two sections of an elementary educational 
psychology course and consisted of eighty male and female students. 
They were told at the beginning of the term that one of their three 
examinations that semester would be taken directly from the true- 
false and multiple-choice questions found in their workbook. Three 
weeks before the examination the students were told that they 
could check their workbook answers with a set of answer keys 
constructed by the instructor. They were thus given the chance to 
get the correct answers to the five hundred and twenty-three ques- 
tions in the workbook. The test, was to consist of one hundred of 
the five hundred and twenty-three questions in the workbook. The 
instructor stipulated that they could come any time during a two- 
week period to look at the keys provided they had completed their 
workbook exercises. The instructor frankly admitted that the stu- 
dents could pretend they had answered the workbook questions by 
haphazardly filling in pluses and minuses and numbers although 
he emphasized that this would not help them study for the exami- 
nation. He also admitted that some students could copy the ‘key’ 
answers from another student who had done the work and looked 
at the keys. Furthermore, it was not compulsory to look at the in- 
structor’s key, for this was to help the student prepare for the work- 
hook examination, and, therefore, it was optional with the student, 


® whether he wanted to appear or not. The instructor kept track of 
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T SCORES oF STUDENTS USING THE SCORING KEY as 
MPARED TO Tuose Nor Usine Key 


Number of Students | Number of Students ae 
Using Key Not Using Key aoe 
25 6 31 
11 8 4 19 % 
9 18 27 
45 32 77 


RESULTS: CLASSROOM STUDY 


ts are shown in Table 3. We note that of the seventy- 
udents taking the test, forty-five made use of the key while 
did not do so officially—that is they did not come and 
it for the key. The extent that those not using the key 
looked at someone else’s corrected exercises is a matter 


aring the two groups, a chi-square analysis was made 
and the difference between the two groups was found to be 
the .01 level (X? = 13.301, d.f. = 2). The results are 
fairly obvious by inspection. There were twice as many 
scoring below 80 who did not make use of the key as those 
And of those who scored above 90 there were more than 
es as many who used the key as those who did not. 


DISCUSSION: THE THREE STUDIES COMBINED 


are the implications of the findings of the three studies 
Tt has been known for a long time that intelligence in | 
the only factor decisive for high or low scholarship. 
n, are some of the characteristics of the poor student? 
, there seems to be what we shall call here ‘activity delay’, 
a lack of decisiveness of action, a tendency to procrastinate, 
an unwillingness to conform to academic (and possibly 
quirements, routine and regulations. Whether this is 
ck of interest, or whether there is overt interest (as is 
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poen by students) which is not backed up by the activi- « 
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ties to accompany this interest, only further research will tell. This 
study seems to indicate that ‘activity delay’ carries on beyond the 
, classroom. Certainly most students express interest in having their 
attitudes ‘tested’. Particularly when a whole undergraduate school 
is sampled it seems especially significant that the poorer students 
either failed to attend or did not attend until they had received two 
letters and one phone call. It might be argued that these students 
are attempting to keep up with their work and thus do not have 
time to attend ‘attitude tests’. Yet, this is not consistent with the 4 
findings in the classroom. Even when given a chance the poorer 
= ga does not take advantage of study aids. The results of the * 
‘| three studies cited would emphasize that the student’s attitude 
j | towards academic life may be as important (or even more so) than 
W | specific study habits, study aids, tutorial possibilities or native in- 
\ telligence. Further studies are needed in order to ascertain whether 
` these factors carry on into other areas of student life, such as par- 
ticipation in social and athletic activities. Related to the above is 
the question of what was the underlying motivation in attending 
college in the first place. One aspect of this process; namely, pres- 
tige in terms of membership in select groups, is now being studied 
by the present authors. The question can then properly be raised 
as to the effect of activity delay on social group activities. | 
SUMMARY l 

A series of three studies were reported concerned with the moti- | 
vational differences between high-and low-scholarship students in 
college. The following findings were postulated: 

1) The poor college student is characterized by ‘activity delay’, 
i.e., a lack of decisiveness of action, a tendency to procrastinate and 
perhaps an unwillingness to conform to academic requirements, 

| routine and regulations. 

2) This activity delay is not limited to the classroom only but i 
exhibits itself in regard to activities usually regarded as outside f 
the classroom sphere such as voluntary;participation in research 
studies in psychology and OME projects such as attitude. 
surveys. © 

3) This study pointed toward the assumption that the poor- 
scholarship student does not necessarily score lower on psy' chologi- 

p @al tests designed to measure intelligence, but that very often 
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4 
and motivation are primary contributors towards 


=} AA 5 

r further raised the question whether activity delay 
other aspects of student life and indicated that 
progress to study this area. 
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THE PROBLEM-ORIENTED APPROACH TO 
TEACHING PSYCHOLOGY 


W. J. McKEACHIE and WESLEY HILER 


A t boi University of Michigan 
è 


How can students, comprehend’ and retain more of their text- 


book material? Many students who take a course involving con- 
cepts unfamiliar to them have considerable difficulty in gaining a 


=m_woncrete understanding of the material presented in their textbook, 


They read along passively and feel that they understand what they 
are reading but when examined on it, they discover how little 
they have mastered. We cannot attribute this difficulty merely to 
poor study habits, for many of the students who do poorly in a new 


_ subject are doing well in other fields with which they are more 


Ø. 


2 


familiar. 

We feel that the basic problem in these cases is the lack of an 
adequate frame of reference with which to organize and assimilate 
the various concepts, principles, and facts which are found in the 
textbook. Learning is easiest when it consists of filling the gaps in a 
“generalized knowledge of the whole.”! Every subject matter, be 
it science, literature, or the arts, is an organized body of knowledge;_ 
not a mere array of isolated facts; hence a knowledge of this sub- 
ject matter should be an organized structure within the student's 
mind. Expectations, questions, and problems are intrinsic to all 
such organized structures. These represent the gaps. Material 
which answers questions implicit or inherent in an organized know- 
ledge of a subject matter is learned with little effort. This is in con- 
trast to the extremely laborious and,energy consuming process of 
rote memorizing. _ 

On the basis of this theory, how can we help those who have no 
frame of reference which can be used to facilitate learning of new 
material, and who do not even know how to distinguish the essen- 
tial points from illustrative details? If we are not to resort to rote 
memory and all its disadvantages, we must artificially construct & 
‘demand structure’ in the student’s mind which the facts to be 
learned will satisfy. This can be done by proposing a problem or 

1 This conception of learning is as old as Herbart, but has been further 
rai 4 by tk» Gestalt psychologists. 
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question for which the person must actively seek the an- 
ably the problem should be meaningful to the individ- 
capable of arousing Ñ genuine interest on his part. If we 
our students to become ‘problem-ofiented’ towards their 
matter, they will learn it more easily. 
ason has reported the use of a work sheet system which 
the ‘intelligent question’ method in tea¢hing elementary 1 
and reports great success (3). Hegbelieves that the es- 
ng to be learned in psychology is the psychologist’s par- 
‘way of dealing with problems. Students must have an 
ity to gain practice in the use of psychological conceptsamcs= 
ninations must test this ability. “The students should 
kind of psychological processes will be valued and cred- 
the examinations, and they should be given every possible 
ty and aid in deliberately cultivating these psychological 
in their work for the course” (3, p. 363). 
3 procedure is recommended by Cason: “The instructor 
out the kind of questions that he desires for each topic. The 
to these questions include everything the students are 
to learn from the lectures, reading, class discussion and from 
all other sources. The one hundred or one hundred and 
Uestions on the subject matter of the whole course are given 
students in advance” (3, p. 367). In this fashion the student 
ow what he needs to know while he is reading the text or 
to a lecture. The questions used on the examinations are 
from this fist but are not identical in form. 
nately Cason presents no experimental evidence in sup- 
his method, so we are forced to generalize from the favor- 
ts obtained with work books in high-school courses re- 
by Anderson (1), Harrington and Lippert (6) and Hurd (7). 
imental study dealing with the use of work sheets is 
by Greene (4). He found on both immediate testing and 
after from one to four weeks that the results of ‘guided 
g’ (consisting in answering true-false questions on & work 
While reading) were very superior to either unguided reading 
® lecture. 
Motivational conditions in Greene’s experiment, however, may @ 
from those in an ordinary classroom situation. A, 
disadvantage is that Greene’s criterion was completely ø 
ated, for he merely used the examination sheets T: 
i e 
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sheets. This procedure is somewhat similar to that of improving 

one’s ‘intelligence’ by studying intelligence test manuals. Further- 

more, one cannot be certain that the over-all knowledge of the 

guided reading group was superior to the other groups. It might 

be that in learning more thoroughly the material to be tested on the 
r, exe nination the students learned less of the other important ma- 
X terial covered bythe text but not tested. 

Tn order to gain information concerning the value of study ques- 
tions for an elementary psychology course in an actual classroom 
situation using items on the criterion test which are different from 

=the study questions, the following Experiment 1 was conducted. 


Se EEN y SEE 


EXPERIMENT I 


The study was carried out with three hundred and sixty students 
of elementary psychology at the University of Michigan. These 
students all used the same textbook and attended the same lectures. 
They had been enrolled on the basis of student choice of hour in 
eleven discussion sections, conducted by advanced graduate stu- 
dents. These discussion sections met for one two-hour period. In 
addition the students spent two hours a week in large lecture sec- 
tions. 

Seven discussion sections were given a work sheet containing 
seventy-five short ‘answer questions and problems covering the 
textbook? material on sensation and perception paralleling the se- 
quence of topics in the text. Four of these sections, which we shall 
call Group A, were not required to work out the work sheet and 
turn it in. The other three sections, which we call Group B, were | 
required to work out and turn in the work sheet. The work sheets 
were then corrected and returned to the students. No students were 
discarded from Group B because of failure to do the work. As it 
turned out, this experimental group contains eighteen students who 
did not actually work out the work sheet. The remaining four sec- 
tions, Group C, did not receive a work sheet. 

Sections were assigned to these three groups in such a way that 
the three groups were as nearly equal as possible as measured by 
performance on the mid-term examination. Of the four instructors 
involved, three taught three sections each, and each of the three 
sections received a different treatment, while the fourth instructor 4 


2 ice used was Woodworth and Marquis’ Psychology- 
+] ńį 24 
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two sections which were assigned by chance to condi- 
adc. 

ar final examination (multiple-choice type) which had 
epared independently of the questions used on the work 

ed as a criterion for measuring the effectiveness of the 

eets, As it turned out only four questions on the examjna- 
wered the textbook material on perception and sensation 4 
nation questions were identical to those on the work 


student in the two experimental groups was matched with 
in the control group on the basis of his score on the finalgres= ” 
m as a whole.* This was done by matching as many 
ores as possible and then matching scores with a differ- 
8 in such a way that the total score for each experimental 
as identical to the total of the control group. Thus the 
f the groups to be compared were perfectly matched. Since 
‘could not be matched with any in the control group, we 
‘of 87 pairs in the matching of Group B with C and an N 
s in the matching of A with C. 

umber of errors each student made on the perception and 


ion questions was then tabulated and the difference between 


+ 
RESULTS OF EXPERIMENT 1 


‘mean and standard deviation of the distribution of differ- 
cores on the perception items was calculated, and the 
ifference between the pairs was tested for significance. The 
difference between the group which had been given the work sheet 
uired to do it and turn it in (Group A), and the control 
ved to be insignificant. However the difference between 
trol group and the group which had been given the work 
th instructions to work it out and turn it in (Group B) 
e highly significant. The control group made forty-two 
ore errors on perception and sensation than did experi- 
p B. See Table I. 


y tentatively conclude that under the circumstances of 
e 
nly four questions out of one hundred and twenty dealt with 
m and perception, it was not necessary to subtract the score on thes 5 
tro: the total score in matching the groups. Our procedure would tend 
the significance level, but only slightly. ie rf 
-6 
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TABLE I.—ERRORS oN Perception ITEMS on FINAL Examination 


Standard |p 
Mean Deviation of N P 
Differences 
Group A 1.063 f 
¿a matched with 1.16 111 -085 
Group C 1.252 
Group B 0.839 
matched with 98 87 .01 
Group C 1.195 


this study the use of work sheets was effective and that the group of 
students required to work them out was superior in the retention 
and comprehension of the textbook material as tested by final ex- 
amination to students who had not used work sheets. Our original 
hypothesis about the effectiveness of work sheets has been con- 
firmed. 


STUDENT OPINION ON THE USE OF WORK SHEETS 


In order to see what the students who had been given the work 
sheets felt about their value, a questionnaire was handed out to 
one of the sections which had filled out and turned in a twenty- 
question work sheet. 

Out of the twenty students (all of the members of the class) who 
answered and returned the questionnaire, eighteen expressed ap- 
proval of the general idea whereas two were opposed to it. 

Some typical comments made by the students were as follows: 

“I think they greatly aid study and help the student pick out the 
important points.” t 

“These sheets definitely help in giving a clearer understanding 
of the textbook.” 

“Work sheets are an excellent idea, if they are not to be cor- 
rected, since they indicate what material is to be concentrated 
upon. To know what parts of the text are important (which is often 
difficult to determine) would bring the whole course into a clearer 
and better perspective.” 

„ Thus in spite of the additional work required by the use of work 


» sheets, many students reacted favorably towards them. 
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EXPERIMENT 2 


conclusion ef Experiment 1, we still were left with two 
about the use of study questions. 
the study questions been effective simply because they 


d by the questions? g 
increased learning produced by study questions gained 
e of material not covered by the questions? 

wer these questions another experiment was carried out in 
dy time was controlled by giving study questions during 
riod. Students spent one class period studying the topje 
ving’ using thirteen study questions provided for them. 
group was instructed to spend the class period in study- 
me material, but was given no study questions. The ques- 
d on the work sheet were of the ‘short answer’ type, i.e., 
id be answered in one or two sentences. 

the last ten minutes of the’ class period, the students 
en a criterion test made up of ten multiple-choice items. 
on this criterion test were constructed to fall into 
sories: (1) those questions directly related to a study ques- 
ering the same material but different in structure), and 
uestions not based on the study questions. 


RESULTS OF EXPERIMENT 2 


sults of Experiment 2 showed that the students who had 
iven study questions did make higher scores than the control 
n the items directly related to the questions. This higher 
s not gained at the expense of areas not covered by the 


Tasim I1.—Errors on CRITERION Test 


Group Using 
stu inde Control Group 
aS ES 
M sD N M sD N P 


directly related to study| 1.9 | 1.2 | 95 | 2.5 | 1.0 85 | .01 


s. Total possible—5. e 
not based on study ques-| 1.8 | 1.15 1.8 | 1.27 
. Total possible—5. AES 
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questions, for scores on items not based on the study questions did 
not differ for the two groups. See Table II. ,, 


DISCUSSION AND CONCLUSIONS 


On the basis of the evidence at hand, we conclude that work 
shes are effective in improving performance on examinations. We 
have discussed the theoretical basis for this effectiveness in our 
introduction, but our experiments and the reports obtained from 
the students enable us to become more specific. 

Work sheets help the student in the process of selection. Faced 


SSeavith a mass of facts, he needs to know which facts or units of 


| 


material are most important. With a work sheet he can emphasize 
these facts in his study. Our evidence indicates that the selection 
of important facts does not interfere with the learning of material 
not selected for study questions. In addition, study questions can 
provide a means whereby the student can organize the vast array 
of facts and concepts into a unified framework which facilitates 
subsequent learning. The questions provide a framework into which 
the student can fit unfamiliar subject matter. By helping him find 
-the logical inter-relationships between facts and principles, they 
may aid him in categorizing and unitizing the subject matter. Our 
students reported that in addition to their selective function, the 
work sheets give a clearer understanding of the textbook and bring 
the whole course into perspective. j 

Our theory of the function of study questions possesses certain 
implications for the construction of such questions. 

1) Questions should be designed which require the student to see 
the implications of a fact, to relate it to other facts and to draw in- 
ferences. We would expect a question requiring only rote memory of 
a specific fact to be of little value in organization. 

2) No question should be asked which requires a more complete 
frame of reference to answer intelligently than is possessed by the 
beginner in a new field. The questions should be capable of being 
answered correctly on the basis of the material available to the 
student. We would guess that a too difficult question would be 
likely to cause the student to distort his cognitive organization of 
the material in order to get some sort of answer. 

3) No question should be asked which can be answered by & 
mere quotation from the book. The most effective type of ques- 
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bably one which presents a concrete situation or prob- 
the stugent to interpret or solve it on the basis of 
oncepts or principles. This sort of question, we believe, 
» student to look for relationships. 
oblem which has not been touched upon is the optimal 
r of questions to be used for any given amount or type of 
tter. If there are too few questions, not enough of the 
‘matter will be covered. If there are too many questions, the 
of ‘mental dazzle’ (9) will enter into the picture. If there are 
umber of questions and a great many new concepts, the 


at the optimal number of questions would vary with the 
f new concepts in the material and with the learner’s 


SUMMARY 


method was recommended for use in introductory courses. 
experiments were undertaken to obtain evidence of the 
ess of work sheets in an elementary psychology course. 
Tesults proved highly significant in favor of the group 
‘work sheets. 

rvey of student opinion indicated a favorable attitude 
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OW STUDENT-TEACHERS IDENTIFY 
RESPONSIBILITY IN CHILDREN 


DALE B. HARRIS DZ 


“Institute of Child Welfare, University of Minnesota s G 


do teachers define, behaviorally, responsibility and a lack 
onsibility in children? To supply an answer to this question, 
ndred and fifteen women education students were queried 
‘some four months’ experience in practice-teaching. The fakes 
paragraph was supplied to each student teacher: 
a part of a research project in the development of responsi- 
in children, will you assist us by describing two children with 
you are now acquainted? Among the children you are now 
g, select the one you consider the most responsible and the 
consider least responsible in his or her general relationships 
to the group or class, and to other children with whom he 
tes in play. For each child, write a description to show 
ein he is responsible or irresponsible. Describe the behavior 
has caused you to make your judgment. Just what does he 
which makes you name him as the most responsible or the least 
ible child you are now dealing with? Please emphasize be- 
incidents; general trait characteristics are useful but not as 
ul as specific instances of behavior which you can mention.” 
hen teachers were asked to define ‘responsible’ and ‘least re- 
e' children, no requirement as to sex of child was men- 
d, and, since teachers were unevenly distributed over pre- 
age groups and through the elementary grades, unequal 
s at different ages were obtained. Two hundred and twelve 
tions for ‘responsible children’ and one hundred and ninety 
inations for ‘least responsible’ children were collected and 
. Boys and girls appear in a ratio of about three boys to 
girls. Table I shows that practice-teachers, themselves females 
g in the elementary grades, name boys in greater numbers 
ls as being ‘least responsible’. Among the younger children, 
Consistently more often selected than boys for nomination ¢ 
ible’ children. The difference in proportions of the sexes 
s in the intermediate grades. It could be said, perhaps, « 
233 "e e 
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TABLE [I — NUMBERS or CHILDREN REPORTED BY STUDENT-TEACHERS As 
Mosr anp Least RESPONSIBLE, BY AGB 


Most Responsible Least Responsible Total 
Age Group Boys Girls Boys Girls 
N % 
Vier PU lie Ne ln | w | I 
+ ee 
23 | 8|20| 13| 32 | 16] 3 | 4/| 10 | 41 | 100 
4 | 3})u] 1] 4 | 10] 37 | 3 | 1 | 27] 100 
5 | 15] 24] 19| 31 | 22] 35 | 6 | 10 | 62 | 100 
6 | 14| 26] 17] 31 | 16] 29 | 8 | 15 | 55 | 100 
iis ahi 5|27| 18| 40 | 17| 38 | 5 | 27 | 45 | 100 
8 7|21| 10| 30} 9 | 27 | 7 | 2 | 33} 100 
9 oll 7| 2| 15| 43 | 4 | 1 | 35 | 100 
10 | 13] 30] 10} 2 | 16] 37 | 4| 9 | 43} 100 
n |10|36| 5] is | 1] 30 | 2] 7 | 28 | 100 
12 | 7\|27| 6] 23 | 12] 46] 1] 4 | 26 | 100 
= 13 | 3/27} 2/189 4] 36 | 2] 18 | 11 | 100 
Totals | 94 118 148 46 406 


responsibility can be achieved by either sex, but girls are consider- 
ably less likely to be called irresponsible. 

An attempt was made to read a number of protocols and to devise 
a series of categories, under which specific behaviors or behavior 
elements mentioned by the teachers could be subsumed. This pro- 
cedure did not give ‘reliable’ results; i.e., too much inconsistency 
of classification occurred. Protocols were then read one at a time 
and each separate characteristic mentioned for the responsible child 
was marked with red pencil and listed on a large sheet of paper. 
Where another teacher mentioned the same characteristic in the 
same or almost identical words, an additional tally was made. This 
procedure resulted in a great number of specific items, with rela- 
tively few multiple entries. The separate items (phrased in pre- 
cisely or almost precisely the same wording) were then grouped; 
these phrasings were collected into larger groupings by a team of 
judges according to agreement on manifest content. All the tallies 
relating to these groupings were summed and converted into a per- 
centage of the children in that age sample. Examination of these 
groupings showed certain rather gross dimensions of the behavior 
thus classified which could be named by the judges. For the charac- 
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attributed to most responsible children, these dimensions 
‘ormity, work habits, independence of action and motiva- 
itiative), social behavior and skill, and general personality 
icter qualities. A similar set of entries could describe all of 
ate groupings of descriptive items for the least responsible 
© has 

ample of the classification may clarify this procedure and, 
ite the kinds of entries tabulated: i 
ity —(a) Can be reasoned with; responds to suggestions; 
instructions, pays attention; listens. (b) Remembers what 
doesn’t need reminding; stays within bounds; seldom needs 
ne. Can be depended on; doesn’t interrupt, misbehave, run 
talk, ete; well-mannered. (c) Helps keep other children fn 
elps other children ‘keep in line’. (d) Conforms to specific 
involving self-help. 
onding entries for the least responsible group are: (a) 
observe rules and routines; not compliant; will not, take 
ns; ignores discipline. (b) Doesn’t follow directions; must 
ded; pays no attention. (c) Short attention span; distract- 

. 


tabulations were originally under such specific headings, 
ly later collected into broader categories, it was possible to 
e considerable reliability of judgment in classification. Two 
identifying and tallying the behavior elements for a small 
‘protocols achieved about ninety-five per cent agreement. 
nent in collecting these specific entries into broader cate- 
as done by consensus of a team of three. ’ 
too much significance can be attached to the trends reflected 
age breakdowns in the original tables. It was thought at 
‘a number of such trends might appear, and indeed some 
ppear which will be discussed. However, the tables were 
hy for the fact that no characteristics appear suddenly or 


ailed tables of results referred to in this paper have been deposited 
eument number 3974 with the ADI Auxiliary Publications Project, 
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disappear completely within the age range encompassed in this 
study. 2 
The responsible child is seen as a conforming child in a good 
many instances. Conforming to specific routines, and responding to 
suggestions are mentioned by teachers of pre-school groups, mainly, 
% and wflect the importance attached in early childhood education 
to self-help, and socialization by learning group routines and by 
responding to reason and suggestion. A similar form of behavior, 
noted among older as well as younger children, is described by ex- 
pressions like: remembers what to do, doesn’t need reminding, 
epeldom needs discipline, stays within bounds. The characteristic of 
helping keep other children in line, or helping children control their 
own behavior, is more often seen in the child between the ages of 
five and eight. In general, some aspect of conforming behavior, as 
illustrated by the phrases above, is mentioned consistently for one- 
fifth to one-third of the children in any age group. 

Work habits are mentioned in most descriptions of responsible 
children. Doing good, efficient work promptly, finishing work 
begun, asking for extra work or doing more than is requested as- 
gume major importance early in the primary years, being men* 
tioned for three-fifths of 6 and 7-year olds, and about ninety 
per cent of 8- to 11-year olds. Doing housekeeping chores and 
being a dependable messenger or room helper are mentioned in 
about one protocol in four, while putting away toys and materials _ 
appears much more frequently in the preschool and early primary 
years.sUndoubtedly, the characteristic of being helpful to teacher 
is an important factor in the child’s being considered responsible. 

Teachers recognize independence of action as an indicator of 
responsibility. The child who will initiate his own activities, work 
independently or not need supervision, is seen as responsible in 
one-fourth to one-third of the cases. The qualities of being inter- 
ested and highly motivated in work is more commonly mentioned 
after the primary years, possibly because the early enthusiasm of 
children for school is beginning to wane and the pupil who retains 
his enthusiasm is particularly appealing to the teacher. This charac- 
teristic is mentioned in about one protocol in five by ages ten and 
eleven. . k 

vAt a time when educators are devoting a great deal of attention 
ə to social behavior and skill, it is not surprising that social-minded- 
ness isqentioned in half or more of the protocols as an aspect of 
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ible behavior. Sharing toys and taking turns is more a 
again, of the preschool group. Contributing ideas to the 
ion, and being ready to discuss assumes increasing 
ance (from twenty-three per cent at four and five years to 
-three per cent at twelve and thirteen years) in the elemen- 
hool period. Being ‘very much a part of the group’gfair, 
port, ete. are mentioned in one protocol in every five or six. é 
g specific offices in class appears in one-third to two-thirds 
protocols. ni 
imber of general characteristics were rather frequently men- 
responsible children which have not been subsumed 
the headings already discussed. Although maturity of beha- 
be inferred from many of the earlier descriptions, the 
terms mature, poised, codperates, stable, or friendly ap- 
about one-fifth of the protocols. Occasionally the charac- 
tic of responsibility is associated with being intelligent or bright. 
infrequently such terms as neat, precise, perfectionistic, or 
jentious appear. These terms suggest that some children 
achieved a reputation of responsibility with their teachers 
ugh characteristics which mental hygienists sometimes cons 
dicative of emotional difficulty. Children may win their 
s approval through painstaking, meticulous work at the 
f other interests and activities. 
e this study did not attempt to investigate the inter-correla- 
: f specific characteristics, it is clear that several rather dif- 
ent types of behavior are viewed by teachers as characterizing 
ility in children. The child who conforms to routines and 
’s expectations, who does good work and who does it willing- 
beyond the stated requirements, is seen as responsible. 
ild who may be personally helpful to the teacher by house- 
ing chores in the room is also marked as responsible. Teachers, 
do recognize that independence of action and high level 
tion may also characterize the responsible child, as may 
social behavior which is characterized by readiness to dis- 
and contribute to group meetings and by election to class of- 
Probably the most frequently mentioned characteristics have 
© with conformity and the quality of the child’s work habits; © 
characteristics come next in frequency of mention. 
laracteristics attributed to least responsible children seem to 
Writer to make a clearer picture. Lack of compliange with 
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routines is mentioned in about half the protocols. Just as frequent 

is an indication of poor work habits or unwillingness to do school 

work. Failure to participate constructively in group activities or 

being an interfering factor in group activities is also associated with 

little responsibility (from twenty per cent to fifty per cent of the 
% protaxzols). 

Quite a variety of general personality and character qualities 
are used to describe the irresponsible child. Relatively few such 
general qualities were ascribed to the highly responsible child. Such 
terms as nervous, fidgets, highstrung, immature, unstable, cries, 

Sovhines, appear in from twenty to forty per cent of the protocols. 
Specific interpretations of behavior as ‘desires attention’, ‘a show- 
off’, ‘distracts others with his behavior’ are given for equal pro- 
portions of the children. Being careless, forgetful, scatter-brained, 
unpredictable, impulsive, or losing things mentioned not at all 
for youngest children, appears,in seventeen per cent of the protocols 
at ages four and five, and rises to thirty-three per cent of ten- and 
eleven-year olds. Being slow, tardy, or dawdling appears at all 
ages in between ten and twenty per cent of the protocols. Aggres- 
sive and destructive characteristics, toward children or adults, 
appear rather consistently in one-fifth of the cases, regardless of 
age, beyond the nursery group. Such characteristics as solitary, 
unaggressive, unsure of self, not a leader, not liked by others are 
mentioned in a scattering of protocols at almost every age. 

While the picture of the responsible child is generally that of a 
well-adjusted child, the picture of the less responsible child is even 
more clearly that of an emotionally and psychologically unadjusted 
child. He is unhappy, does not get along well with his classmates, 
with his work, and with life in general. 

Is responsibility then a matter of habits or attitudes which may 
be inculcated in the school program? Do teachers teach responsi- 
bility through the giving of tasks and the encouragement of work 
attitudes? This is possible but the present writer believes that 
responsibility is more likely a matter of the general level of the 
child’s psychological adjustment which reflects in his functioning 
as he approaches tasks. The happy child who has some assurance 

> of acceptance by the teacher and by others proves to be responsible. 
He shows enthusiasm for his work, follow-through on tasks. The 

ə unhappy, maladjusted child, perhaps because of his preoccupa- 
tion with his problems, has little inclination to follow-through 02 
wie i 
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sks. He is therefore identified as showing little responsi- 
hether giving such a child specific tasks and encouraging 
low through on them will develop a sense of responsibility, 
ated. If being given the task brings witlt it a sense of 
tion of status which the child needs, his performance, 
, will be improved. In many cases, however, this as- £ 
task or being given a ‘responsibility’ on which he 
e up does not relate to the child’s psychological dis- 
nce. He is given one additional assignment to which he cannot 
hal a concerted attack. Another in his series of failures is likely 
shalked up against him, thus further confirming the teacher's" 
f that the child shows very little responsibility. This is another 

le of the ‘vicious cirele’ effect in the reénforcement of learn- 


descriptions pose a problem of considerable significance 
al hygiene in education. They suggest that responsibility 
rrelate of good adjustment and can be achieved by whatever 
nprove the child’s general life adjustment and that responsi- 
probably not to be sought as a skill, habit or characteristic 


SUMMARY 

hundred and fifteen student-teachers supplied behavior 
ms of the most and least responsible children in their 
While there is some tendency for teachers to nominate more 
han boys as responsible, especially in the preschool and pri- 
groups, there is a marked tendency for teachers to nomi- 


ty through activities and require little supervision; they are 
reeful and socially oriented, participate well and help others. 
tesponsible children are not conforming or compliant; they 
help with work or ‘prodding’; they bother other children and 
erly group activities; they appear to their teachers as 
mature, nervous, irritable, careless, forgetful, aggressive, 
or self-centered. They present a clear picture of poor 
ty integration and adjustment. Responsibility appears to & 
ession of quality of adjustment to demands rather than 
i habit, or skill in itself. 
we y ý 
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FALSIFICATION OF INTEREST PATTERNS ON 
THE KUDER PREFERENCE RECORD 


EDWARD J. DURNALL, JR. 
Y e Nasson College, Springvale, Maine 


Various studies have been conducted to determine whether it 
is possible to falsify interests on the Kuder Preference Record. The 
general conclusion appears to be that it is quite simple to achieve 

=m high or a low score in a particular interest area (1). This fact has 
tended to limit the use of the Kuder Preference Record in those 
situations where a certain interest was zequired for employment or 
for admission to an institution or a particular program of instruc- 
tion. The possibility of falsifying an entire interest pattern, rather 
than one or two interest areas, has not been adequately explored. 

The method of this study was the administration of the Kuder 
Preference Record-Form BB to a group of students, using thé 
instructions furnished by the author. The inventory was then re- 
peated within one week, with specific instructions to the students 
to check the items as they believed a person with interest in a 
certain occupational field would check them. The occupational field 
selected was that of accountant-auditor, since Kuder has furnished 
norms for this group (2) and since the accountant-auditor profile 
score may be determined by means of Fisher’s discriminant func- 
tion (3). The Otis Quick-Scoring Mental Ability Test-Form C was 
administered to the group to give some measure of intelligence in 
relation to the ability to falsify. 

The group selected for this experiment was made up of two 
classes in personnel management at a four-year co-educational, 
liberal arts college. As the number of women involved was quite 
small, the analysis was limited to the scores of the seventy-mine 
men. 


RESULTS 

1) There is little doubt that it is possible to simulate entire inter- 
ə est patterns of certain occupational fields. Figure 1 presents ® 
graphic comparison of the mean interest profiles of the students on 
the first testing as compared with those of personnel managers as 
reported by Kuder (2), and Figure 2 presents a graphic comparison 
o f 240 
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— Profile of means of students on their first test compared with the 
Norms for Personnel Managers. 


mean interest profiles of the students on the second testing 

d with accountant-auditors as reported by Kuder (2). 
noted that the average true-interest profile of the group 

lar to that of personnel managers while, with the simu- 

, the average interest profile of the students was typical 
ntant-auditors. 

mean accountant-auditor score of the group on the first 

33.25. On the second or simulated test, the mean was 

| The critical ratio between these two scores was 19.14, leav- 
le possible doubt that the difference was significant. 

he students with high accountant-auditor scores on the first 

tion of the Kuder made slightly higher accountant- 

scores on the second administration than did the students 

y accountant-auditor scores. The mean accountant-auditer 
the upper quarter from the first administration was 105.63 * 
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Fia. 2.—Profile of mean of students on their simulated test compared 
with the Kuder Norms for Accountant Auditors. 


on the simulated testing; while that of the lower quarter from the 
first testing was 98.84 on the second. The critical ratio between 
these two scores was 1.09. À 

4) The correlation between the gain in the accountant-auditor 
score from the first administration to the second administration 
and the Otis IQ was .39, with a standard error of .09. A similar cor- 
relation (.31) was found to exist between the accountant-auditor 
scores on the simulated test and the Otis IQ scores. The multiple 
correlation coefficient between Otis IQ scores and true accountant- 
auditor scores on the one hand and simulated accountant-auditor 
scores on the other, was .50, with a standard error of .08. 


CONCLUSIONS 


It seems apparent from this study that it is entirely possible 
to falsify an entire interest pattern on the Kuder Preference Rec- 
ord. That the ability to do this is related to IQ is shown by RE 
on the Otis test. It is also related to the true interest patterns © 


o the students, since the student possessing a high Otis IQ and some 


6 D 


HERI 


D 


= Falsification of Interest Patterns 23 


ld to be simulated finds it quite simple to achieve 
| the profile for the occupational field in question. 
the Kuder Preference Record in situations which 
d a person for showing a certain interest pattern 
question. 
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THE RELATIONSHIP BETWEEN CLASS 
ABSENCES AND COLLEGE GRADES 


ALEXIS M. ANIKEEFF 
® Department of Psychology 
; 


Oklahoma A. and M. College 


For the purpose of developing a sound administrative policy 
toward college class attendance, an inquiry was initiated to deter- 

ine the nature of relationships which may exist between class 
absences and final grades of students enrolled in the School of 
Business and Industry. 


PROCEDURE 


At the conclusion of the fall semester, thirty-four faculty mem- 
bers in the School of Business and Industry submitted a roster of 
2654 students who were enrolleđ"in their classes. This roster in- 
dicated the number of class absences for each student, apd the 
student’s final letter grade in the course. Established numerical 
quality point equivalents of 3, 2, 1, 0, and —1 were substituted 
for letter grades A, B, C, D, and F, respectively. Students were 
then placed in categories according to the number of absences 
which each student accumulated during the semester in each of 
his courses. Arithmetic means of quality point distributions were 
next derived for each absence category. In this manner, data be- 
came available for plotting class absences on the Y axis and quality 
point averages on the X axis of scatter diagrams. Linearity of 
scatter diagrams suggested the computation of Pearsonian product- 
moment coefficients of correlation. 

Moreover, in order to visualize the obtained data from another 
perspective, students were grouped according to letter grades and 
the median number of absences for each letter-grade category was 
computed. This procedure represented more realistically the usual 
academic routine of assigning grades and recording number of 
class absences. In addition, the use of medians rather than arith- 
metic means, reduced the potency of extreme cases. As such, the 
data conceivably appeared in a more descriptive form. 
> Tnasmuch as the same instructor indicated both the letter g" ade 
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umber of absences for each student, an instructor’s 

to assign grades which are based in part upon the number 
accumulated by a student could lead to spuriously 
tions between grades and absences. As a partial meas- 

e extensiveness of such a practice, each instructor was 
according to the median number of absences foundén his œ 
‘The instructor was again ranked in terms of the arithmetic © 
of the quality points which he assigned as final grades to 
Spearman rho as illustrated by Guilford (2) was derived 

ibe the relationship between instructors ranked according 

of class absences and instructors ranked according tee== 
atic means of assigned quality points. Although the obtained 
tion may not indicate the specific extent to which each in- 
used number of absences as a criterion for determining 

lass grades, some measure of the over-all tendency for all 

ors to do so could be indicated. 


RESULTS 


points were found to vary inversely with number of ab- 
on all class levels, as Table 1 indicates. The obtained co- 
of correlation range from —.633 to —.962, and in all cases 
ant at either the .05 or .01 levels of confidence. By 
g the r obtained for the four-year breakdown, it is evident 
3 per cent of the class grade variance in this category can be 
ited to number of class absences and other related factors, 
nly 17 per centi#of the class grade variance can be attributed 
ted factors. Using the same procedure, the reader can 
the extent of variance attributed to class absences for 

g class levels. 
the number of cases in each absence category decreases as 
of class absences increases, a cut-off point had to be 
which would permit the greatest number of points to be 
on a scatter diagram without sacrificing the stability of 
ained point. It was found advisable to exclude from analy- 
absence categories which contained less than ten cases. Table 
licates the percentage of population used to establish scatter 
points on each class level after excluding from analysis 
ce categories which contained less than ten cases. In this 
tion, it should be noted that although an absence category 
excluded in one class level breakdown, the previously ex- 
$ © 
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TABLE 1.—ReLATIONSHIP BETWEEN CLASS ABSENCES AND QUALITY POINTS 


Class Levels aler e LON. a F 

Four Year 99.13 16 —.910** —.816 
Junior-Senior 97.38 12 —.962** —.918 

D Fæshman-Sophomore 98.89 15 — .936** — .867 
® Senior 96.54 9 | —.706* — 426 
Junior 95.13 12 — .902** —.797 
Sophomore 97.31 13 — .633* — 346 
Freshman 94.86 11 — .929** — 848 


==> * —.01 level of confidence. 
** — 05 level of confidence. 


cluded absence category can be included in the combined class 
level breakdowns owing to the function of summation. As a conse- 
quence, the r obtained for the four-year breakdown includes ab- 
sence categories ranging from no absences to sixteen absences, and 
excludes less than one per cent of the cases. In Table 1 the double 
asterisk indicates a .01 level of confidence, while the single asterisk 
indicates the .05 level. In the same table, obtained r’s were cor- 
rected for a small number of cases as suggested by Croxton and 
Cowden (1). 

In general, for all cases and on all class levels, the median ab- 
sence is smallest for A grade students and increases progressively 
for B, C, D, and F grade students. Table 2 illustrates the fore- 
going statement. It is noteworthy that the median absence of 
senior and junior level A grade students differs markedly from B 


TABLE 2.—Mep1an NUMBER or CLASS ABSENCES AccoRDING TO 
LETTER Grapes 


Letter Grades 


Class Levels ` 
c D F 
Four Year 1.82 | 2.50 | 3.26 | 3.85 | 5.19 
Junior-Senior 1.74 3.03 3.51 4.50 5.75 
Freshman-Sophomore 1.90 | 2.02 | 3.07 | 3.63 | 5.04 
) Senior 1.35 3.00 2.95 5.67 2.00 
Junior 219 | 3.06 | 3.45 | 4.25 | 6.00 
v Sophomore 211 | 2.24 | 3.38 | 4.05 | 5.40 
Freshman 1.62 | 1.63 | 2.36 | 3.06 | 4-78 
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students on the same levels. A similar situation does not ap- 

hold for Sreshman and sophomore levels. The apparent 
y of the senior F grade student’s absence median in 
results from the use of a very small number of unevenly 
uted cases. 


and grading leniency of an instructor. Coefficients of cor- 
tained for this comparison range from —.118 on the 
year level, through —.139 for the junior-senior level, to —.367 
 freshman-sophomore level. 


, DISCUSSION 


statistically significant inverse relationship is found between 
yer of absences and final grades, for all class levels and all 
tions of class levels. Although the relationship between 
y grading leniency and number of class absences is both very 
“and statistically non-significant, the extent to which each 
member uses number of class absences as a partial criterion 
igning final grades remains unknown. Theoretically, if all 
culty members weighted number of class absences as a criterion 
ong signing grades to the same degree, and if all faculty members 
‘Used identical systems of grade distributions, strong negative rela- 
p onsi should have been obtained. In brief, instructors who had 
'l test number of absences would have issued the lowest 
‘to their students. Actually, although the obtained coeffi- 
of correlation are negative, these correlation coefficients are 
very stong nor statistically significant. 
other factor which could contribute to the inverse relation- 
between quality points and class absences, is that of motiva- 
ack of motivation could explain the accumulation of a large 
of class absences. Moreover, if motiyation is wanting, & 
expends less effort on his studies and thereby earns lower 
es in his course work. As a consequence, whether one measures 
‘umber of accumulated class absences or the excellence of a 
dent’s grades, the same factor—motivation—may conceivably 
Measured in both cases. Two measurements of the same factor 
correlate highly with each other since both are obviously 
res of the same characteristic being compared with itealf. 
pproximation of the extent to which motivation may contri- ` 
class grades can be secured by relating the factor eye 
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significant relationship is found between number 6f class g 
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tion to those factors which well-documented research indicates are 
related to college grades. Possibly the most decumented factor is 
that of intelligence. Since numerous summaries of the relation- 
ships between intelligence tests and grades (3) for college students 
range from .20 to .79, with the modal coefficients ranging from .30 
to .5U, cognizance must be made of the intelligence factor and its 
contribution to the grade received by a student. This being the case, 
the factor of motivation can only contribute that portion to the 
class grade which has not already been contributed by intelligence 
and other factors. In order to support the assumption that the fac- 
Sor of motivation can account for the high coefficients of correlation 
which are obtained between number of class absences and honor 
point averages, the results obtained in this study must be recon- 
ciled with results obtained and documented in other studies. As- 
suming an absence of overlapping in correlations between intelli- 
gence and motivation, as well as an absence of correlation between 
class absences and intelligence, results obtained in this study would 
permit a range of correlation coefficients from .27 to .77 to exist 
between intelligence and class grades. Such a range of correlation 
coefficients a hes rather closely the previously indicated range 
of summaries of relationships between intelligence and college 
grades. Consequently, the assumption that motivation of a student 
can account for the high correlations found between number of 
class absences and honor point averages is not inimical to results 
obtained in other studies. 

The obtained results are obviously peculiar to the sample used 
and the methods employed in this study. However, to the extent 
that other studies, using other samples, and employing diverse 
experimental designs, obtain comparable results, an interesting 
possibility presents itself. Instead of spending countless hours in the 
construction, development, administration, reading, and grading 
of college examinations, an instructor could assign grades accord- 
ing to number of class absences and apparently achieve closely 
approximate results in most instances. 


SUMMARY 
Thirty-four college faculty members submitted a roster of 2654 
students who were enrolled in their classes. This roster indicated 
both the number of class absences and the final course grade for 
each stadent. » 
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grades were transformed into quality point equiva- 

an coefficients of correlation were derived by placing 
absences on the Y axis and quality point averages on the 
obtained coefficients ranged from —.633 to —.962, 


e 

ian number of absences for each letter grade category 
for all class levels. In general, for all cases and on all 
the median number of absences was smallest for A 
and it increased progressively through letter grades 


partial measure of the prevalency with which instructors 
the number of class absences for the determination of 

tades, a Spearman rank order coefficient of correlation was 

ranking instructors according to the median number of 

n their classes and the arithmetic mean of the final class 

. The coefficients of correlation obtained for this rela- 

ged from —.118 to —.367, and none were significantly 

traditional levels of confidence. 

o the number of accumulated class absélices and the 

ity point averages may both have a motivational fac- 

on, the obtained coefficients of correlation between 

class absences and quality point averages may be 

high. 
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BOOK REVIEWS 


Everurt L. Sxostrom AND Lawrence M. Brammer. The Dy- 
namics of the Counseling Process. New York: McGraw-Hill 
Book Company, Inc., 1952, pp. 213. $3.50. 
D 2 
This volume is a practical rather than theoretical discussion of 
the counseling process, with particular emphasis on the treatment 
of educational-vocational problems. The authors herein define 
counseling as “a purposeful, reciprocal relationship between two 
==} people in which one, a trained person, helps the other to change 
himself or the environment.” The stated frame of reference is 
‘self-adjustive’ with the aim of counseling being greater self-direc- 
tion and self-responsibility. This actually represents the philosophy 
and techniques of Rogerian client-centered therapy with some 
minor modifications. The modifications, for the most part, involve 
recommending greater use of psychological tests and inventories 
than is usually advocated by Rogers and his students. 

The goal of this ‘permissive or client-centered approach’ is “to 
facilitate the individual’s total maturity as well as’ focus on his 
immediate problem-solving processes.” Counseling is seen as & 
situation where the client “may reévaluate his experiences and 80 
bring the self into closer harmony with experience.” The self, in 
this case, is defined as “the individual’s dynamic organization of 
concepts, values, goals and ideas which determine the ways m 
which he should behave.” ; 

After discussing the self-adjustive approach in terms of Roger's 
notions of personality and growth, the authors turn their attention 
to actual counseling procedures. They discuss establishing readiness 
for counseling, the initial interview, the use of tests and occupa- 
tional information, and the synthesis (or terminal) interview. The 
final chapter is a discussion of the application of the client-cen- 
tered approach to classroom teaching, the organization of courses 
and their content, and concludes with a client-centered philosophy 
of education. / 

Shostrom and Brammer, throughout the text, stress the gapen 

° ant problem of dealing with the client’s attitudinal and emotion®® — 
„responses to counseling. Their use of pre-counseling group een 
» tion sessions to explain the goals of counseling and to help clien 
@ clarify their expectations about counseling seems to be a very uoe 
D j 250 
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. The section on interpreting test results to the client 
| done and may provide counselors with several help- 
es, 
hors strongly advocate the self-adjustive, i.e., client- 
roach as preferable to the directive approach, the 
ternative discussed. Their reasons for this posițjon 


ms of democracy and those of client-centered ther- 
d the same.” Implicit in this statement is the argu- 
democracy is the best form of government (and who 
against this point of view), then client-centered treat- 
e form of therapy. Meehl and McClosky in “Ethical 
ti al aspects of applied psychology” (J. abnorm. soc. 
, pp. 91-98) have pointed out the sophistry of this 
g and called for the use of empirically derived criteria to 
‘the efficacy of the counseling process. 
acy of this position by the authors also stems from 
ticipation in the Stanford Guidance Study, a research 
which is reported in some detail. This study seemingly 
criterion measures of the effectiveness of counseling: 
information about counseling procedures (the client's 
erentiate different types of tests; ability to name com- 
d tests; etc.), (2) feeling tone (how the client ‘feels’ 
he counseling experience; to be inferred from his verbal 
nts,), (3) self-direction (the client’s attitudes about him- 
la tion to a counseling experience; to be inferred from verbal 
nts). The authors claim that the results of the study dem- 
that the group treated by client-centered techniques was 
on the last two criteria to a group counseled by ‘more 
directive techniques’. ji 
y question the usefulness of these particular criterion 
and the authors’ conclusions, based on this single study, 
over-whelming advantages of the client-centered. ap- 
Other studies, using different criterion measures, have 
lite different conclusions. For example, in a recently 
(study, Klingelhofer (The Effects of Educational Counsel- 
cholastic Achievement of Probationary College Students. 
ed doctor’s dissertation, State University of Iowa, 1953.) 
a highly directive approach in counseling was efficacious, 
hprovement in the scholastic performance of collegg stu- 
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d ideological and empirical. They believe that “the , 
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. 
dents on probation as the criterion measure. A survey of the re- 
search literature in this area (unfortunately neglected by the 
authors) would scarcely permit unequivocal statements about the 
over-all superiority of one counseling technique over another at 
this time. 
® Dhis reviewer wonders about the audience for which this text was 
© written. If directed to the professional psychologist, then much of 
this material such as the discussion of the learning theories of 
Hull, Tolman, et al, will be quite low level. If, on the other hand, 
it is aimed at the guidance officer(s) in the secondary school or col- 
=== lege, then these discussions require more psychological training than 
such readers are likely to have. The problem of the audience seems 
especially acute as the authors do not discuss the problems of 
counselor training or counselor qualifications. This is especially 
unfortunate in these times of a growing professional awareness on 
the part of counselors. 

While they do point out that these counseling procedures are 
usually for the individual with ‘personal problems that we all have’ 
and that students with problems ‘too involved’ should be turned 
over to a psychologist, no operational definitions of such cases are 
supplied nor are the possible dangers involved in such cases dis- 
cussed. No differentiation is made between the counseling proce- 
dures recommended by the authors and those to be followed by 
the psychologist to whom the client is referred. Indeed, no men- 
tion is made anywhere in this volume of psychotherapy or of any 
of the other usual techniques that might be utilized by a clinical 
psychologist after referral. 

The volume is quite readable, very practical, and filled with 
many ‘helpful hints’ about counseling and interviewing. Therein 
lies the danger. A näive, unsophisticated reader may imagine that 
all counseling is as simple as the authors’ specimen cases and can 
be learned, ‘cook book’ fashion, with the present volume as the 
guide. It is extremely unfortunate that the limitations of these 
client-centered techniques and the necessary cautions that should 
be followed in all counseling programs are not more fully treated. 

Lronarp D. GoopsTEIN 


: State University of Towa 
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» 
R. F. Mater. Principles of Human Relations. New York: 
Wiley & Sons, Inc., 1952, pp. 456. 


n relations problems have received considerable attention 
ists and psychologists. Maier’s book presents prin- 

rocedures that make important contributions to #slu- c 
i these problems. The book is designed to be used by psy- . 
and by industrial personnel who recognize the problems 
but have encountered resistance to the development 
human relations. It may not have as much appeal to in- 
sts who do not recognize existing problems or to those who «= 
jt had enough psychological training to recognize the value 
nciples and methods presented. Certainly, trainers look- 
recipe book’ will be disappointed. However, Maier’s book 
more than enough material to develop an excellent train- 
if the builder is prepared to exploit what is offered and 
the program to the needs. 
design of the book may have placed some limitations upon 
Maier says: “Basically, supervisory training is a preventa- 
m in the control of attitudes.” The discussion is confined 
ciples and methods for the changing of attitudes. The train- 
pervisors, on the job, is mainly a problem of this kind. 
the training of employees to take new supervisory jobs 

e the development of new concepts and new behaviors 
the changing of attitudes. Maier recognizes that new be- 
we to be developed from time to time, but he mentions 
e usual training methods are adequate for this and goes no 


eration of the content leaves little doubt that the book 
‘important aid. Although the chapters are not grouped 
s, there is significant order given to their arrangement. 
uction clearly sets up a philosophy of training for better 
ations in industry. In Chapter Two there is an outline 
ing program. The next chapters present methods to be 
aining programs and group activities. Discussion methods, 
, and a group decision procedure are exceptionally well s 
. Chapters Eight and Nine contain case studies which are 
of the content found in actual situations where the 
ve mentioned methods were used. The last two chapters depart © 
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from group participation and discuss, respectively, the supervisor's 
dealing with individuals and non-directive counseling. j 

The author has earned considerable respect for his work and 
writing. The present book will not lessen this respect. He has writ- 
ten for all persons interested in human relations problems and all 
of these people should have his book available for reference. The 
book is valuable because it offers so much that has been subjected. 
to empirical trial and that is so clearly presented. The best hu- 
man relations programs will always involve much of what has 
been dealt with in this book. 


Duane C. SHaw 


Carnegie Institute of Technology 
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GROUP-DISCUSSION METHOD OF * 
TEACHING PSYCHOLOGY’ 


ALBERT EGLASH 


Mayor's Rehabilitation Committee, Detroit, Mich. 


d White’s (3) experiment on democratic and auto- 
hip of boys’ clubs is the indirect source of the teach- 
e used in this investigation, and is the direct source of 
nental design. The direct source of the teaching method 
on supervision in industry, given by N. R. F. Maier at 
érsity of Michigan, in which the principles applied by Lip- 

9 clubs are applied to industry. Maier defines democratic 
p in terms of the leader offering the group problems rather 
itions, Within the limits of his area of freedom, the demo- 
Teader allows the group to make its own decisions. 

directly through his own writings and indirectly through 
minar, Carl Rogers” (4) nondirective philosophy has played 
ential rôle here. In addition, this approach is related ideo- 
to Anderson’s (1) studies of integrative and dominative 
‘in classrooms. 
experiment! has been designed to answer these questions: 
will achievement in a group-discussion class compare 
in a lecture class? 
will students’ reactions in a group-discussion class com- 
those in a lecture class? 


i PROCEDURE 
classes were in Introductory Psychology, and were taught 


on and lecture methods of teaching. In both classes 
’s reading assignment was given on the first day of class. 
ed at Michigan State College. etait 
257 GAE 
LS 


instructor, who has had the same amount of experience Š 
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At the end of each two-week period, the same multiple-choice quiz 
was given to each class. These quizzes included both text and class 
material. In addition, each class was given the same comprehensive 
departmental final. On the last day of the term, each section was 
asked to complete a Teacher Evaluation Sheet used by the college. 
An attempt was made in both sections to keep the emotional at- 
® ‘mosphere warm and permissive. The instructor allowed himself to 
be the same person in each section—informal, considerate of stu- 
dents’ needs, undogmatic, i.e., quick to admit his own lack of knowl- 
edge or conviction, and with humor directed more towards him- 
If and his profession (teacher, psychologist, scientist) than 
Oasis the students. He felt, in both sections, that he had gotten to 
know some of the students as individual personalities, and that a 
very human relationship had been established between himself and 
the class. Students in both sections were given an opportunity 
to criticize the conduct of the course and to offer suggestions for 
improvement. In both classes some of the suggestions were utilized. 
In short, the instructor attempted to keep the classes identical 
except with respect to teaching method. However, the frequency of 
meeting differed: The discussion class met for one hour a day in the 
afternoon, four days a week; the lecture class met for two hours an 
evening, two evenings a week. While one might expect the eve- 
ning class to have more adults, casual observation suggested that 
both classes were composed almost exclusively of students of the 
same age group. Mean ACE scores and grade-point averages were 
almost identical in the two sections (respective CR’s are .38 and 
-64). Each section held approximately fifty students. 


DISCUSSION CLASS 


On the first day of the quarter, the students were told that this 
would be a discussion class, and that no lecture material would be 
given. They were asked to form discussion groups of approximately 
six students each, and to select a discussion leader. Each group then 
proceeded to discuss a series of questions given them by the Im- 
structor, questions relating to the conduct of the course: number, 
kind, and weighting of examinations; extent to which class-attend- 
ance and participation should influence the grade; topics they 
wished to take up. After a twenty-minute group-discussion period, 

5) ` each group leader reported his group’s decision to the class, and the 
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s decision, based on these eight group decisions, was 

icly made. 

e method was used in order to cover the content of the 

, the beginning of a two-week period, each group was 

t ewritten question, and the groups were given time in 
cuss their respective questions. After a twenty-minute 

on period, each leader reported to the class the ideas @ 

within his group. The class as a whole would then dis- 

uup’s ideas. t 

structor’s rôle was one of listening, trying to understand 

t as it was made, occasionally reflecting or summarizing eam 

said, sometimes integrating one comment with another, 

the various points on the blackboard. Incorrect informa- 

critically accepted and listed. Requests for facts or for 

were handled by the instructor offering the question back 

class. While this technique resembles one which Faw (2) has 

|, in that the discussion is student-centered, it differs in two 
The discussion is oriented primarily around questions 

y the instructor and 2) Intellectual content rather than 


t on the board, and each discussion group recommended a cut- 
int for each letter grade. After the class decision had been 
ch individual was notified of his own score. After each 
iscussion groups were given the task of offering sugges- 


LECTURE CLASS 


cisions concerning examinations which had been made by 

on section were imposed on the lecture class. The ma- 
ng out of class discussions was offered to the lecture class 
5 of lecture, movies, and other demonstration materials. 
ne number of A’s, B’s, etc., which the discussion class gave 
es after each quiz were given to the lecture class. 


- 


RESULTS 


hievement—There are no consistent differences either in means e 
ces, and what differences there were were not statisti- 
le. On the first quiz, the examination questions and fhe i 
questions were on the text material. The discussion 
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group did somewhat better. During the second two-week period, 
the discussion questions related to the groups’ topics. For example, 
one group had expressed an interest in doodling. Since the text 
assignment was on sensation, they were given the question, “What 
senses are involved in doodling and finger-painting?” Here the 
trend is sharply reversed. As in the other quizzes, we find little 
difference in achievement on material which, growing out of class 
discussion, was presented by lecture to the control group. 

During the third period, the discussion questions were on class 
procedure, e.g., “What are the things you like and dislike about this 


sx=method of teaching elementary psychology?” Achievement on text 


> 


material, which was not mentioned in the discussion class but which 
was explained to the lecture class, is equal for the two groups. 
During the fourth period, as during the first, the discussion ques- 
tions related to text material, though less directly than before. As 
on the first quiz, the discussion class did better on text material, 
but did no better on discussion material. During the fifth two-week 
period, the discussion questions again related to the text material. 
Here the two classes are equal on both text and class material. 

The final examination failed to indicate any difference in achieve- 
ment between the two classes. 

Aititudes.—(a) Quantitative: As shown in Table 1, the five-point 


TABLE 1.—TEACHER EVALUATION SHEET 


F 
Mean scores Variance 
Scale cR gieus 
Dia Lecture i “tare 
1. Were important objectives met? 3.3 | 4.2 | 5.20** 2.64%" 
2. Did the instructor’s presentatioñ en- | 2.4 | 3.6 | 5.45** | 1.86 


hance learning? 


8. Is the instructor’s speech effective? 3.5 3.8 | 1.76 1.12) 

4, How well does he work with students? | 3.6 3.9 | 1.20 | 1.91 

5. Does he stimulate independent think- | 4.1 | 3.7 | 2.00* | 1.12 
ing? 

6. Do his grading procedures give valid | 3.8 | 4.2 | 1.82 | 1.47 
results? i 

7. How does he rank with other instruc- | 3.1 | 3.9 |4.00** | 1.42 
tors? 

— 


* .05 level of confidence. 
** Ollevel of confidence. 
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Teacher Evaluation Sheet indicate differences both in 
and in variances between the two groups. The lecture class 
ore than did the discussion class, that the important object- 
‘the course had been met; that the instructor’s method of 
g the material enhanced the learning experience; that the 
works well with students, i.e., they feel free to come to 


should be ranked above the average instructor in the col- 
Only with respect to his ability to stimulate independent 
g did the discussion group rate the instructior higher than 


alitative: There is a place on the Teacher Evaluation Sheet 
ts. The following is a composite, largely in the students’ 
ords, of both favorable and unfavorable comments in each 


favorable comments made by the discussion class—“We should 
been warned at registration that this was going to be a dis- 
ion class, This shows the drawbacks of modern education. Stu- 
don’t know what they want, yet they run the class.” 

don’t have enough background to know when we are right 
rong. Instead of giving us this background, the instructor just 
ds in front of the class, and leaves us up in the air, confused, 

g, with nothing but a bunch of mixed-up facts and no con- 
ideas. Too much time is taken in discussing procedure rather 
Subject matter; the discussions were irrelevant and uninterest- 
We learned nothing from them. The whole thing is a farce, a 
of time and money, and a poor preparation for future psy- 
y courses. We can get as much out of not coming to class. 
instructor is failing to accept ‘his responsibilities as a teacher.” 
$ e text assignments are too long, the text is inappropriate for 
sind of course, the type of test given is invalid for the discus- 
method, and some students who either don’t attend class or 
‘participate still get the best grades, because no credit is given 
participation, ”* 
s method won’t work unless we are brought up in this sys- 
d are used to it, and unless everyone coöperates. It allows 

h independent thinking.” e 
@ instructor makes me feel like I’m being psychoanalyzed; « 
bably writing a book, and we're his guinea pigs.” © 
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extra help; that the grading procedure is satisfactory; and © 
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Two comments made during class discussions should also be 
added here: 


“We work hard for our money to pay tuition here, and you're — 


cheating us out of that money. The college pays you a salary to 
teach us, and you’re failing to do what you’re paid to do.” 
“You preach democracy to us, and then you don’t practice it, 


® You impose a method of teaching on usthat we don’t like—we came 


here to learn psychology, not democracy—and then when we vote 
that you should lecture to us, you refuse.” 
Unfavorable comments made by the lecture class —‘The instructor 


=<sseldom smiles, and seems unhappy; he is unsocial, withdrawn, im- 


personal, difficult to know, so objective that one never gets to know 
his opinions, lacking in self-confidence, and unwilling to assume the 
leadership of the class, being too lenient in dealing with noisy stu- 
dents, too democratic. He doesn’t exercise enough discipline, and is 
inclined to allow unnecessary discussion. He should challenge stu- 
dents when they say something, and should present his own views 
more; we don’t know when we’re accurate.” 

“Sometimes the lectures and other materials are not well organ- 
ized, and do not clarify what is in the text or what is on the test.” 

(And, by a male:) ‘He shows sex discrimination.” 

Favorable comments made by the discussion class—‘‘The class was 
democratic, stimulating us to think on our own. This is a stimu- 
lating change from the class lectures I usually sleep through, and, 
in place of my usual apathy, makes me think. The students teach 
themselves.” 

“Codperative student-teacher planning of the course is a good 
idea. It’s only right that students have something to say about the 
way the class is run. We take up topics we choose ourselves, and I 
like the freedom from restraint.’” 

Favorable comments made by the lecture class —“The instructor 18 
informative, presenting an abundance of material so that it can be 
readily absorbed. He covers the material well, explaining it clearly, 
without going in circles, and makes it interesting. He is well-versed 
in his subject, bringing in the latest developments. The examina- 
tions are fair,” : 

* “The frequent breaks make for a relaxed atmosphere; this was 
mere democratic than most my classes.” 
» “The course has helped me a great deal; I have developed a 
deeper insight into people as individuals through this course. 
w] 
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ctor is conscientious, going out of his way to help 
| He is unusually sensitive to interpersonal relationships, 
you feel that he is talking to you, that he is interested 


d, by a female: “Perhaps it’s because he is young, but I feel 
ch at home around this instructor. I’m in favor of younger, 
odern and informal instructors. ...” 
om behavior: In the discussion class various reactions 
stration were noted. Much aggressive behavior was expressed, 
usually verbally, occasionally in other ways. On one occasion, when 
ctor asked the class what they would like to do with thessa 
ning in the class period, one student got to his feet, an- 
Tsuggest I leave and go study my physics,” and marched 

ng the door. An indication of the tone which some- 
vailed, especially during the bimonthly gripe sessions, ` 
Seen in comments offered after a particularly traumatic 
the instructor’s view) session: “How can you just stand there 
t?” “Boy, I’d hate to be in your boots.” At no time did 
ctor notice aggression expressed by students towards stu- 


frustration reactions also occurred. One student declined 
group. Others would not participate in their group’s dis- 
, but held aloof, sometimes read. 
enever the students were offered the opportunity to make 
ions for improving the conduct of the course, within the limi- 
that the instructor not lecture, the most frequent suggestion 
‘was that the instructor lecture. Students who called atten- 
this fixated behavior termed it ‘dependency’. 
responding behavior was noticed in the lecture class. When 
or suggestions were invited, minor suggestions—e.g., 
taking a break during the two-hour period, or about the in- 
Uctor’s handwriting—were offered. 


$ CONCLUSIONS 


weni—The method of teaching did not affect achieve- 
type of examinations currently used in the course. 

esults do not support the view, expressed by the students, 
instructor is failing in his duty to the students and in þis 
ities to the college if he merely leads discussions and does «< 
information. The results do not support the gjudents’ 
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view that, for effective learning to occur, they need explanation 

from the instructor. As Faw remarks: “There is certainly no reason 

to believe, judging from the test results, that the student-centered 

group was handicapped intellectually. The widespread observation 

on the part of the students that the student-centered group discus- 

sions,resulted in less of an accumulation of facts and information 
® was not borne by the facts... .” (2, p. 108). 

2) In other words, the results do not support the view that learn- 
ing is more effective when an instructor speaks, outlines, and illus- 
trates with movies and demonstrations, than when he contributes 

#<gpnly an opportunity for the students to discuss the material—or 
not even this: During the third two-week period, the discussion 
bore no relation to the text or test, but the discussion class did as 
well as the lecture class. Apparently the instructor’s explanations 
and demonstrations were a difference which made no difference. 

3) The students’ claim that they lack the background necessary 
for a creative discussion is not supported. As Faw states: “Actually 
the discussion brought out a wealth of facts and information which 
were authentic and verified. In many instances the instructor felt 
that the presentation of facts by students was as comprehensive 
and was much more interesting than if he had made the presenta- 
tion” (3, p. 109). 

Some degree of validation for this view may be seen in the fact 
that, when the discussion material was presented to the lecture class, 
it was favorably received as meaty and relevant. This favorable 
reception was reflected both in the class response at the time and 
on the Teacher Evaluation Sheets. 

4) On the other hand, the results do not support the view, held 
by the writer, that giving the students the opportunity to think 
and to express their ideas will make for better learning than will 
forcing the students into a more passive, receptive rôle, in which 
they memorize facts. On their own discussion material, the dis- 
cussion class was not consistently better than the lecture class. Our 
learning principles Say that recitation is more effective than memo- 
rization, but the principle seems not applicable to this classroom 
situation. f 

' 5) One would expect that throwing the students on their a 
might stimulate some and frustrate others, and that this wo F, 

, Cause a greater spread of achievement scores among the studen 
in the djscussion class, This did not occur. : 
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_ ATTITUDE 


he method oftteching did affect attitudes towards the course. 
here is a difference between one’s achievement and one’s 
evement. Faw also found this “discrepancy between 
ents feel about their accumulation of knowledge and 
results” (2, p. 109). The discussion class felt that*they 
little; the lecture class felt that they had learned a 
‘As measured by examinations, both learned an equal 


ents’ feeling of satisfaction, of liking or disliking a course, 
nction of methodology or technique of teaching, as well a 
ctor’s personality and the course content. This differ- 
a difference. 
ne would expect that throwing the students on their own 
stimulate some and frustrate others, and that this would 
è à greater spread of attitude scores among the students in the 
ion class. This occurred. 
Even apart from the difference in the degree to which each 
liked the course, there was an important difference in the type 
action of the two classes. In the lecture class, the students took 
evaluative attitude towards the person of the instructor, com- 
ting both favorably and unfavorably upon him. In the discus- 
m class, the students seemed simply to accept the instructor, and 
onded largely to the method of teaching. In other words, in the 
on class rejection of the teaching technique was not accom- 
by rejection of the instructor as a person, and enthusiasm 
€ teaching technique was not accompanied by any transfer- 
of this positive feeling to the person of the instructor. 


CRATIC AND AUTOCRATIC LEADERSHIP IN THE CLASSROOM 
hile the psychologist may define democracy and autocracy 

S of whether the decision-making function lies with the 

p or remains with the leader, the students are likely to define 
Concepts by other criteria: t 

) Are we pleased? The group-discussion class was perceived by 
nts as an attempt on the part of the instructor to force 

g upon them. The lecture class was perceived by the stu- 
being both factual and entertaining, as serving their needs 
‘matters as breaks during the lecture period and generosity < 
ades, and as meeting their expectations. © 
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(b) What kind of attitude does the instructor have? The students 
perceive as democratic any instructor whô allows them a certain 
amount of freedom in such matters as reading newspapers or whis- 
pering while in class, who is courteous, relaxed, humorous, who 
states things in a conditional, non-authoritarian manner, “He has 
coffee with students, and learns their names.” “He’s willing to take 
time to chat with them after class.” “His decisions are based upon 
what he believes is best for them, rather than upon what is most 
convenient for himself.” 

This was the perception of the students in the lecture class, In | 
œŒ the discussion class, the instructor never used the term ‘demo 

cratic’. Nevertheless, from the first week the students spontane- 
ously referred to the class as democratic. Having so defined the 
class, they proceeded to: 

(a) Project this definition onto the instructor, i.e., they insisted 
that he had defined the class as democratic, and that he had been 
preaching democracy to them; 

(b) Chide him, whenever he refused to yield to any student de- 
mand, with failing to live up to his preachings. X 

2. There is no evidence here that students prefer a group-discus- 
sion and group-decision method, or that they like integration and 
resent domination. Even in the vital matter of grades, the class 
who decided for themselves the proportion of A’s and B’s were lest 
satisfied than was the class in which the instructor made the deci- 
sion. 

3) Defining type of leadership in terms of decision-making, Wë 
find no evidence here that autocracy produces more aggression 
than does democracy. : 

4) Finally, there is no evidence here that a leader best imple- 
ments the type of instruction in which he believes (4, p. 425). This 
instructor, while enjoying both methods of teaching, strongly Pre 
fers the group-discussion method of leadership, whether in the class- 
room, in industry, in hospital and prison administration, or in y 
other situation. He finds the client-centered philosophy, the non- 
directive technique, congenial, Nevertheless, he apparently doe’ 
a better job as a directive leader. J It 

This experiment was carried out during the Winter pee 
was repeated, with essentially similar results, during the Spring 

» Quarter. During the Summer Session following these two quarter 

the waiter mailed to all four groups of students an objective exami 
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E genn, and a course evaluation scale. The 
o: also given to the Summer Session stu- 
ion of returns from each of the four post-final 
‘approximately one-third. A comparison of Winter, 
er students failed to show any evidence of post- 
Equality of achievement between lecture anddis- 
remained. However, when the course was over and 
‘in, the two types of class, discussion and lecture, 
equally favorable attitude towards the course. 


the emotional climate similar in the two classes, an in- 


by lecture. The group-decisions were used in order 
the conduct of the course in such matters as examina- 
, and also to cover the course content. Following 
f Lippitt and White, the instructor imposed the same 
the control class, and covered the same material by 
es and demonstrations. 

t on course content was not significantly different. 
le of the lecture class, however, was significantly higher 
Of the ‘democratic’ class. 
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STUDENT ATTITUDE AND ACHIEVEMENT N 
A COURSE IN INTRODUCTORY STATISTICS 


A. W. BENDIG and J. B. HUGHES IIT! 
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In faculty conversations concerning the teaching of psychology, 
the viewpoint is often expressed that instructors of courses in intro- 
ductory statistics are faced with the special problem of the emo- 
tional attitude of the student toward the course. A large number of 


© S® students seem to fear that they will be unable to learn the content 


matter of the course, either because of an inadequate background 
in mathematics or because of an emotional hangover acquired from 
previous somewhat traumatic exposures to high school and college 
mathematics courses. If this problem really exists, it has failed to 
be recognized in the elaborate recommendations about undergradu- 
ate training in statistics that have been sponsored and reported by 
the American Statistical Association (3, 12, 13, 14). Attention is 
paid to course content and structuring, and to the required edu- 
cational background of both instructors and students. However, 
little attention is directed toward the effect of the mental attitude 
of the student in the biological and social sciences, who is often Tea 
quired to take such a course as part of his undergraduate training 
(1), upon his academic achievement in statistics. The results of 
Margaret and Sherriffs (9), indicating that the attitude of students 
toward psychology is materially related to achievement in the in- 
troductory psychology course, suggest that a similar relationship 
may be present in the introductory course in statistics. 
The problem seems to be threefold: (a) can such an attitude be 
reliably measured, (b) if so, is this attitude related to student 
achievement in the course, and (c) what are the significant con- 
comitants of such an attitude. The purpose of this paper is to report 
the results of a preliminary investigation of these questions. 


THE ATTITUDE SCALE 


In the Fall, 1950-51, semester one class period of the senior 
author’s course in introductory psychological statistics was devo! 


> ! The authors wish to express their appreciation to Dr. Henry Zagorski 
and Mr. H. J. Maginley for their coöperation in obtaining a part 0 
data for this study. 
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sion of student attitude toward the course. The in- 
presented to the students his observations and hypotheses 

ing student attitude and course achievement and encour- 
yerbalizations by the students of their fears about what would 
ired of them in the course. The students were asked to write 
short paragraphs on their attitude and anonymously urn 
to the instructor at the next class period. Parenthetically, © 
t be noted that the reaction of the students was both enthus- 
nd helpful. Following the collection of the paragraphs, fifty 
ly oriented statements were derived from the paragraphs 
g all aspects of the attitudes reported by the students. For «> @ 
ple, three of the statements were: 

we less math ability than most of my friends.” 
| probably never use what I learn in this course after I gradu- 


fy grade in this course will probably be closer to a D than to 


fifty statements were mimeographed as a test booklet with 
uctions requesting the students to rate each of the statements 
| five-point scale as to how well the statement reflected their 
about the course. This test was administered to all students 
oductory psychological statistics classes at the beginning of 
following semester: Spring 1950-51. In addition, the letter 
Teceived by each student was obtained at the end of that 

A total of fifty students took both the attitude test at the 
ng of the semester and received a completed letter grade at 


è first step in the analysis of the data was to dichotomize each 
the fifty items on the original form of the test were analyzed 
d the number of students selecting each of the five rating 
es on each item. Each item was then dichotomized by com- 
g adjacent response categories to achieve, as closely as was 
e, a cutting point that would divide the respondents on that 
into two equally sized groups. On most of the items approxi- 
half of the subjects chose the first, most favorable choice 
the remaining subjects spread over the other four, less favor- 
tings. The combined favorable ratings for each item were 
La weight of one, the combined unfavorable categories a weight 
¥ero. A scoring key for the combined favorable choices on the c 
ms was made and each of the answer sheets scored, each 
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subject receiving as a raw score the number of keyed favorable 
responses that he made. These raw scores ranged from four to 
forty-eight out of a possible fifty. Each item was then correlated 
with the total score on the test to get estimates of item reliability. 


i These item reliabilities ranged from 0.07 to 0.87. 
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estimate of the validity of each item as a predictor of achieve- 
ment was obtained by correlating the item with students’ letter 
grades in the statistics course. The five letter grades were dichoto- | 
mized by combining the A and B students into one group, and the | 
C, D, and F students into a second group. The letter grade cri- 
terion used here was found to have a reliability of 0.73 in a previous 
investigation (1). Phi correlation coefficients were computed be- 
tween the dichotomized letter grade criterion and each item. These 
correlations ranged from —0.17 to 0.44. The two correlation co- 
efficients for each item (reliability and validity) were each weighted 
by the standard deviation of the item, graphically plotted, and the 
thirty items having the highest reliability and validity selected for 
the final form of the test, using the selection technique outlined by 
Gulliksen (7, pp. 382-385). A revised test key containing these 
thirty items was used to rescore the answer sheets of the original 
group of fifty students (Sample A) and the results are reported in 
Table 1. The reliability of the new key, using Kuder-Richardson 
Formula 20 (7, p. 224), was found to be 0.96. Raw scores from the 
new key were correlated (by the usual product-moment method) 
with quantified letter grades (A being assigned a numerical 
value of 4, B a value of 3, etc.) and a correlation of 0.24 was found. 
Such a magnitude of correlation indicates that approximately five 
per cent of student achievement in Sample A could be accounted 
for by variation in student attitude as measured by our test. J 
Cureton (2) has amusingly pointed out the fallacy in validating 
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Taste 1—Resvuits or Two SAMPLES OF STUDENTS ON THE STATISTICI 


COURSE ATTITUDE SCALE 


Original Sample Validation Sample 
MORESEE 


Number of Students 50 a 7 
Mean 17.48 pe, 
Standard Deviation 9.54 a: 
` Reliability (KR-20) 0.96 Se 
Validity 0.24 0. 
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y and validity of the new thirty-item attitude scale, 
given to all students in introductory psychological 
classes at the beginning of the Fall, 1951-52, semester. 


ments will be reported in a later section of this paper. A 
ity-one students completed all three measuring instru- 
also received a complete achievement grade for the 
results on the attitude scale for this group (Sample B) 
found in Table 1. The reliability of the test (again using 
rdson Formula 20) was 0.90 and the product-moment 
‘between raw scores on the test and quantified letter 
0.21. The differences between the pairs of reliability 
a 7 coefficients were tested for statistical significance by 
z technique outlined by Edwards (4, pp. 131-132). The drop 
ibility of the scale in Sample B (0.06) was significant at 

of confidence (t = 2.49), while the decrease in the 
( cient (0.03) was not significant (t = 0.17). While scale 
y Was significantly lower in Sample B, its magnitude (0.90) 
hat the thirty-item attitude test remains a reliable instru- 
the new sample. The similarity in validity coefficients in 
ples leads to the conclusion that there is a consistent 


CONCOMITANTS OF ATTITUDE 


ntioned above, the seventy-one students in Sample B had 
, in addition to the attitude test, the Kuder Preference 
biographical data sheet. The raw scores from the nine 
on the Kuder were converted to stanine scores, based 
raduate norms of University of Pittsburgh students as 


y the students on the biographical information sheet six 
that were hypothesized as possible concomitants of atti- 
rived and subsequently dichotomized for convenience 
F . These variables were: ti 
[ajor subject—Students majoring in psychology were given a 
cal weight of one: non-majors were weighted zero. œ 


EARI 


fe pe 


ad by the University Testing Service. From the personal data 


ch student was given the Kuder Preference Record |. 
jographical information blank. The data from theseJast ’ 
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Sex.—Male students were weighted one: female students zero, 

Amount of high school mathematics —Students taking 2.5 or more 
years of mathematics in high school were weighted one: those with 
two years or less were weighted zero. 

Amount of college mathematics—Those having taken one or more 
semesters of college mathematics courses weighted one: students 


© not having had any college mathematics training were weighted 
zero. 
Number of previous psythology courses—Three or more semesters 
of psychology was weighted one: one or two semesters weighted 
© æ Zero. 


College class.—Senior, graduate, and non-degree students were 
grouped and weighted one: sophomore and junior level students 
were given a weight of zero. 

The Kuder stanine scores were correlated with raw scores on the 
attitude scale by the conventional product-moment method. Point- 
biserial correlations (6, p. 329) were computed between the con- 
tinuous scores on the attitude scale and the dichotomous Major 
Subject and Sex variables. The correlations between attitude scores 
and the remaining four variables were computed using the biserial 
correlation formula (6, p. 324), since a dichotomy had been arbi- 
trarily imposed upon these continuous variables. These correlations 
are reported in Table 2. Four of the fifteen variables demonstrated 
relationships with attitude that were significant at the .01 level of 
confidence. The Kuder Computational scale, amount of high school 
mathematics, and amount of college mathematics were all posi 
tively related to a favorable attitude toward the course. Number of 
psychology courses was negatively related to a favorable attitude, 

The final question asked of the data was what was the minimum 
number of variables that would best predict student attitude and 
how much did each of these variables contribute to total attitude 
variation. The simplified technique reported by Gengerelli (6) was 
used to select the five most important predictors of attitude out 0 
the total group of fifteen variables. The five selected included the 
four variables reported above as showing significant covariation 
with attitude, plus the Kuder Persuasive scale whose cone 
with the attitude scale (22) fell just short of statistical seni 
(.23 is significant at the .05 level with seventy degrees of fr eee 
From the correlations of these five variables with the attitude © 


terion, and the intercorrelations of the variables, multiple regret 
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Correlation | Beta Weight | Per cent of 


0.16 jd 
0.31* 0.32 10, 
0.20 
0.22* 0.61 13 
0.02 
0.03 
0.08 
0.07 
—0.04 
—0.21 
0.13 
0.39* 0.30 12 
0.40* 0.28 11 
—0.42* —0.54 
0.15 


igh School Math 
ollege Math 


cant at the 0.01 level. 


cients were computed using the method detailed by Pe- 
Van Voorhis (17, pp. 226-229). These coefficients are re- 
as Beta weights in Table 2. The multiple correlation (cor- 
or shrinkage) between these five predictors and the attitude 
m was .82. The final column in Table 2 gives the multiplica- 
oduct of each variable’s correlation with attitude and its 
ht. Guilford (6, p. 432) suggests that variable weights, 
in this fashion, indicate the percentage of total criterion 
n that is accounted for by each predictor variable and that 
of these weights (actually the squared multiple correlation 
gives the percentage of criterion variability predicted 
entire battery of predictors. The weights found in Table 2 
Kuder Computational and Persuasive scales, amount of 
ol mathematics, and amount of college mathematics, each 
for approximately twelve per cent of attitude variability, 

number of psychology courses predicts almost twice the vari- , 

(twenty-three per cent) represented by any other single pre- 

i & 
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dictor. The battery of five variables shows covariation with sixty: 
nine per cent of the criterion variability, . 


DISCUSSION 


In terms of the original three questions proposed at the beginning 
of this paper we now have some tentative answers. A thirty-item 
© test was constructed to measure students’ attitudes toward an it. 
troductory statistics course designed for students majoring in pays 
chology and related biological and social sciences. This test showed 
a satisfactory magnitude of reliability (.90) when given to a new 
© @sample of seventy-one students, The empirical validity of the test ia 
actually measuring ‘true’ student attitude is as yet undetermined, 
but the rational or face validity inherent in the technique by whieh 
it was constructed suggests that such an attitude dimension i 
actually being tapped by the test. 

Student attitude measured by the test has been shown in two 
samples to account for four to five per cent of student variability 
in course achievement. Theoretically, such a finding corroborates 
the hypothesis voiced by instructors of introductory statisties 
courses that student attitude is a partial determinant of student 
achievement. Practically, this finding suggests two recommenda- 
tions: (a) instructors of such courses should attempt to modify 
and allay the fears of their student early in the course if maximum 

+ student achievement is the academic goal, and (b) any battery of 
measuring instruments devised to predict achievement in simila 
courses should include some personality or temperament 
device that can be logically expected to tap this attitude dimensioa. 
Such an attitude scale can be expected to predict a portion of tbe 
criterion variance not measured by the aptitude testa most com 
monly used in such prediction, 

Tt seems quite logical that increased exposure to mathematic 
courses in high school and college, plus an interest in computational 
pursuits, should dispose a student toward a statistics course. Such # 
background gives the student a familiarity with, and a liking fof 
intellectually handling, the quantitative concepts so important * 
part of statistics. In passing, it may be noted that Kuder (8, p- 5) 

“lists the occupations of Statistician and Statistical Clerk ss - 
showing major interests on the Computational scale. The relation 
á persuasivo interests to attitude is less clear, but it can be pointed 
out thgt a course in statistics is only partially quantitative in 
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ion, Kuder (8, p. 7) lists both Advertising Statistician 
Statistician occupational groups as showing high in- 
h the Computational and Persuasive scales. ‘The some- 
result reported in Table 2 is the strong negative reln- 
n number of psychology courses and favorableness of 
ch a finding may be explained by the type of stydent 


than two semesters of psychology are undergraduate 
y majors (the correlation between Major Subject and 
‘of Paychology Courses was .56, the highest intercorrelation 


‘as one of the departmental prerequisites for graduation, 
with few previous psychology courses are typically 
in some other field who take the course as an elective sub- 
the graduation and possible later admission to graduate 
he student who is majoring in psychology is partially, at 
odent upon his achievement in this course, we might 
expect him to enter the course with more fear and trem- 
‘the non-major elective student. Another possible expla- 
ne which the authors indignantly reject) is that our 
majors are refugees from quantitative thinking and de- 
their antipathy toward a forced exposure to such training 
itive attitude toward the course. The first explanation of- 
as much more plausible (and ego-placating) to us than 
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IINING STUDENTS’ CONCEPTS OF 
IVE TEACHING FROM THEIR 
j RATINGS OF INSTRUCTORS 


WILLIAM E. COFFMAN! è 


Educational Testing Service 


the twenty-seven years since the use of student ratings 
ctors first appeared in the literature, discussion has 
around two viewpoints. Critics of the use of student rat © 
‘pointed to the inadequacy of the student as a judge of 
ility, emphasizing his lack of experience and the diffi- 
reporting judgments which are free from subjective bias. 
of student ratings, on the other hand, have tended to 
that effective learning results from the interaction of 
nd teacher and that however biased ratings may be, they 
ble as a source of information concerning student reac- 
the behavior of teachers. 
le evidence has accumulated concerning students’ abil- 
tate instructors, and more and more institutions have 
ed to student ratings for one purpose or another. (6) Well 
studies have demonstrated that student ratings are re- 
(6) Others have provided evidence that ratings on multiple 
s need not be influenced by halo. (3) At least one study 
n that student ratings are similar to ratings of alumni 
years of seasoning. (2) 
ither party to the controversy is content with demonstra- 
f reliability, absence of halo, and conformity to judgments 
individuals who are not teachers. Those who question 
ts’ ability to rate their instructors require proof that student 
e sound before accepting the validity of ratings; those who 
understanding students need the same information as a 
interpreting ratings. 
5 report covers one aspect of a general investigation of Student 
f Instruction conducted under a grant of the Research Foundation « 
na A. and M. College. The writer wishes to acknowledge his debt 
tructors and students who wholeheartedly codperated in the in- 
and to Educational Testing Service for help in preparing the e 
i è 
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A recent study conducted at Brooklyn College (7) approached 
the problem by asking students directly to select from a list of ten 
the three items which they thought most important to good teach- 
ing. The method raises certain questions. Did these ten items cover 
all of the aspects of good teaching which the students might 
value? Were students influenced in their responses by cultural ster- 
eotypes concerning the nature of college teaching? 

An opportunity to approach the problem of student values in a 
different way presented itself when approximately two thousand 
students at Oklahoma A. and M. College rated fifty-five teachers 


®© Ss part of a comprehensive investigation of the Oklahoma A. and 
M. Rating Scale for Instructors. The Rating Scale consists of eight- 
een specific traits and a general estimate of the instructor’s effec- 
tiveness. For each rating the student is asked to check a graphic 
scale upon which are placed fiye descriptive phrases. The traits 
and descriptive phrases are: 

1) Preparation for Class Meetings: Class meetings very care- 
fully planned. Usually well prepared. Preparation often inadequate. 
Little preparation. No preparation. ; 

2) Teacher’s Interest and Enthusiasm in Subject: Very enthusi- 
astic and interested. Frequently shows enthusiasm. Only mildly 
interested. Very seldom shows enthusiasm. Subject irksome to him. 

8) Ability to Arouse Interest in Students: Interest usually runs 
high. Students frequently show interest. Students occasionally 
show interest. Students seldom interested. Majority inattentive 
most of period. 

4) Organization of Course: Every lesson well organized. Most 
lessons well organized. Some organization but not always clear. 
Very little organization. No organization. 

5) Thinking Demanded of Students: Thinking always necessary. 
Work demands much thinking. Some thinking required. Very little 
thinking required. No thinking required. j 

6) Assignments: Clear and definite. Carefully given but indefinite. 
Definite; often hurriedly given. Rather indefinite and often hur- 
riedly given. Very indefinite; usually hurriedly given. i 

7) Sense of Proportion: Stresses fundamental topics, disregards 
trivial details. Spends most of time on important topics, stresses few 
details. Stresses important topics and details equally. Spends more 

© time on details than on important topics. Often neglects subjec 
for other irrelevant topics. 


- 


mough atatimes. Loud enough but not distinct. Words 
indistinct and not easy to hear. Words very indistinct 
ible to hear. 

of Humor: Has keen sense of humor. Frequently shows 


Kn Besant more Aa is in text. Knowledge limited to 
l. Knowledge apparently deficient at times. Knowledge 
ly deficient. oe @ 
a to Express Ril Words come fn! meaning 


neaning pa not clear. Meaning almost never clear. 
Feeling between Instructor and Students: Complete har- 


to prevail. Students frequently antagonized. Instructor 
tagonize class. 

evalence of Cheating on Examinations: By no one. Only 
liently. Occasionally noted. By many students. By most 


SP confidence: Sure of himself; meets difficulties with poise. 
ver disconcerted. Fairly self-comffident; occasionally dis- 
ed. Often confused by students. Hesitant, timid, uncertain. 
olerance and Liberality: Invites differences of opinion. Wel- 
differences of opinion. Shows no personal response to oppo- 
Sometimes impatient when students oppose his views. 
ily aroused to temper by opposition. 
nctuality in Meeting and Dismissing Classes: Prompt. 
ate. Occasionally late. Often late. Very often late. 
onal Appearance: Well-groomed; clothes neat, clean, 
i tidy and careful as to appearance. Some- 
careless as to appearance. Usually rather untidy and care- 
pearance. Slovenly; clothes and person untidy. 
ersonal Peculiarities: Free from mannerisms. Free from 
mannerisms. Annoying mannerisms not serious. Annoy- 
erisms often detract. Constantly exhibits annoying man- , 
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19) General Estimate of Instructor as a Teacher: Excellent, 
Good. Fair. Poor. Very poor. ə 
With ratings available for eighteen specific traits as well as for 
a general estimate of the instructor’s effectiveness, it was possible 
to calculate a matrix of intercorrelations and to study through fac- 
tor amalysis the question: What factors are reflected in the over-all 
® ratings of these students? In other words, this study reports an 
attempt to determine students’ concepts of effective teaching from 
their ratings of instructors. 


PROCEDURE 


The raw data for the study were the average ratings of each of 
the fifty-five instructors on each of the nineteen scales. Coefficients 
of internal consistency had been computed using the method sug- 
gested by Horst. (4) The intercorrelations of the nineteen variables 
were calculated by the product-moment method. Four reference 
factors were extracted by the complete centroid method, (8) and 
then rotated to simple structure using the single-plane method of 
rotation. (8) The reference factor matrix, the communalities, and 
the coefficients of internal consistency are given in Table I. The — 
final rotated matrix of factor loadings is given in Table II. Table | 
ITI contains the direction cosines of the final reference vectors, 
and Table IV, their intercorrelations. It can be seen that the final 
reference vectors are almost orthogonal; therefore, it seemed un- 
necessary to compute the direction cosines of the primary refer- 
ence vectors, their intercorrelations, and the projections of the 
variables on the primaries. Interpretations are made directly from 
Table II. The correlations which seem especially significant for 
naming the factors are starred. 


©% 


RESULTS ' 
Factor A appears to be a factor of empathy. There are high 
correlations with (1) ability to arouse interest in students, a 
sense of humor, (3) feeling between instructor and student, (4) tol- 
erance and liberality, and (5) general estimate of instructor 98 4 
teacher. aw 
The factor seems to include more than a tribute to simple frien f 
liness or to the instructor’s tendency to be easy. Here is aian 
ə recognition that good teaching often requires an awareness of ra 
lems learners encounter, a willingness to be patient with stude 
® 
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I—Tuxz Centro FACTOR Loapines 


h! 

—.52 | —.43 22 | 86 -90 
17 | -.37 1 | .44 70 
37 08 18. |) .77 90 

—.42 | —.37 25 | .89 84 
32 | —.31 o7 | at 81 

—.31 23 12 | 38 «86 
2 | —.04 12 | 46 70 a S 
06 .09 |. —.31 40 .86 
32 31 u 55 91 
10 | —.49 1 «36 69 

—.16 | —.09 | -.43 | .62 89 
26 43 4 | 78 92 
a4 | -13 | -11 19 «66 
10 | —.32 | —.25 | .62 76 
16 46 10 58 -86 

—.68 20 10 | 57 96 

—.46 48 | —.16 | 54 92 
13 9 | —.30 | 47 86 
16 03 1s | -39 84 


ed by Horst Formula (Psychometrika March 1949). 


s, an appreciation of the world of the student as well as 
of the scholar. 
tively high value placed on empathy by these students 
ily representative of the situation generally. In the 
dy (7) ‘sympathetic attitude’ was ranked relatively 
difference noted here may result from the difference in 
students asked directly may be hesitant to admit that 
ude is important. However, the difference can also be 
in terms of differences in academic aptitude in the two 
more difficulty students have with class work, the more: 
to value teachers who are helpful and understanding. 
is identified as a factor of organization. There are high 
ns with: (1) preparation for class meetings and (2) Hres 
is} 
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TABLE II.—TaE OBLIQUE FACTOR MATRIX AFTER ROTATION 


Sa Scale Name Als | ¢ a 
1 | Preparation for Class Meetings —.02 | .92*) .02 |—.05 
2 | Teacher’s Interest and Enthusiasm in| .22 | .40 |—.34*| 97 
d Subject 
3 | Ability to Arouse Interest in Students .65*| .17 |—.06 | .53* 
4 | Organization of Course .13 | .89*| .01| .04 
5 | Thinking Demanded of Student .05 | .04 |—.42*| 19 
6 | Assignments (definiteness of) .26 | .31 | .38 |—.08 
@ 
7 | Sense of Proportion .46 | .23 |—.10 | .40 
8 | Enunciation .17 | .15 | .23 | 52% 
9 | Sense of Humor .64*|—.03 | .11] -39 
10 | Scholarship .00 | .40 |—.40* . 1 
11 | Ability to Express Thought —.03 AL .29 | . 55! 
12 | Feeling Between Instructor and Student] .77*| .01 | .25| .40 
13 | Prevalence of Cheating on Examinations} .12| .17 |—.07 1 
14 | Self-Confidence .07 | .43 |—.08 | .61 
15 | Tolerance and Liberality .65*|—.02 | .32| 30 
16 | Punctuality in Meeting and Dismissing | .00| .45| .54*|-.80 
Classes age 
17 | Personal Appearance S y A RRG EA PETS a 
18 | (Lack of) Personal Peculiarities .B81/—.01 | .34] - 
, 
19 | General Estimate of Instructor as a | .63*| .41*| -03| 40 
Teacher 


ization of the course, and a relatively high correlation with the ovel- 
all rating. The positive correlations with teacher’s interest and en- 
thusiasm in subject, scholarship, ability to express thought, self- 
confidence, and punctuality can be accepted as reflections of the 
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[NTERCORRELATIONS OF THE FINAL REFERENCE VECTORS 


B c D 
1.00 
—.02 1.01 à 
—.10 —.03 1.00 e 


people who have well organized courses and who are pre- 
ch class also tend to be enthusiastic about their subject, 
thoughts more clearly, to be more punctual, and to aP- @ 
ore scholarly and self-confident. The absence of correlation 
hinking demanded of student’ appears significant also, The 
sseem to value a particular type of organization, one where 
uctor clearly indicates what is expected of the student and 
s the situation that they can ‘dig in’ without experienc- 
‘agony of suspended judgment? or the chaos of planning an 
t project. These students, then, value instruction which 
zed by carefully organized sequences and well prepared 
ions, 
ic organization of subject matter’ was one of the quali- 
d by the Brooklyn College students also. (7) The students 
ed it second for science and social science teachers, and fifth 
nities teachers. 
© seems to represent the punctual, neat, and ‘normal’ 
ual who is the opposite of the ‘absent-minded professor’. 
are high positive correlations with punctuality and personal 
e. There are negative correlations with teachers interest 
ah ithusiasm in subject, thinking demanded of students, and 
The person with a low score on this factor is a mild 
f the absent-minded professor—too busy to geta needed 
ving late to class occasionally because he has been 
ig Some academic interest, engrossed in his subject, a scholar 
ocrates in the classroom. P 
tor is not related to the over-all rating; students appar- 
ibe instructors through their ratings and the factor ap- 
the analysis, but it is not reflected in their judgment of 
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Factor D appears to be a factor of verbal fluency. There are 
moderately high correlations with: (1) ability» to arouse interest in 
students, (2) enunciation, (3) ability to express thought, (4) self- 
confidence, and (5) lack of personal peculiarities, and with the over- 
all rating. The correlations with enunciation and with ability to 
express thought are somewhat lower than one would expect if the 
factor were general speaking ability, and the correlations with the 
other scales seem about right if the factor is verbal fluency. Even 
the negative correlation with ‘punctuality in meeting and dismis- 
sing classes’ could be more than an unexplained irregularity in the 


® data, since there is a tendency for the verbally fluent to keep talk- 


ing after the bell marking the end of a period has rung. 

The Brooklyn College students had no opportunity to judge 
verbal fluency. They rated ‘good speaking ability’ low (sixth for 
humanities teachers and seventh for social science and science 


teachers), but they placed higher value on ‘ability to explain | 
clearly’ (first for science teachers and fourth for humanities and 


social science teachers). 


DISCUSSION 


The picture one gets of students from this analysis is not one of 
incompetent judges assessing teaching on the basis of superficial 
evidence. Who will argue that understanding of the learner, 
organization of the course, and verbal fluency are insignificant as- 
pects of good teaching? And if the picture of the ideal teacher seems 
unduly restricted, shall we attribute it to the incompetence of stu- 
dents as judges? Certainly not until we have assessed the limita- 
tions of the rating scale and examined more fully the values of fac- 
ulty members themselves, 

Here is an example in point. These students appear at first glance 
to place little value on stimulation to thinking as an aspect of good 
teaching. At least the scale ‘thinking demanded of students’ is but 
slightly related to the general estimate of effectiveness. But m 
scale is somewhat ambiguous. Does it refer to the thinking whic 
follows stimulating contact with the instructor—or does it mean the 
thinking required because the instructor left the student to 8^ 
for himself? Perhaps a more clearly described scale would bring 
different results. 

On the other hand, the data from the Brooklyn study suggest 
that for certain courses students donot valuestimulation to thought. 


D 


Students’ Concepis of Effective Teaching 285 


venteen per cent of the students listed it as one of the three 

1 desirable qualities for science teachers. Is it possible that 

{ ‘themselves have emphasized other aspects of the teaching 

i so much that students have failed to appreciate the values 
mulation to thought? 

_ Byrevising the scale and repeating the analysis one might clarify 

the interpretation. Three possible types of revision seem indicated. 
i e of the categories in the present scale have low reliability 

y be discarded without significant loss of information— 

ws interest and enthusiasm in subject, scholarship, preva- 

of cheating on examinations). If evidence relative to these iS 9 
other means of collecting information should prove more 
liable. (2) Factors A and D are definitive enough to be included 
separate scales in the rating sheet as a check on the accuracy of 
‘interpretation given here. (Factor B appears to be clearly de- 

ale 1 and Scale 4.) (3) One or more scales related to an 
hecated stimulation to thinking factor might be added in 

of the ambiguous Scale 5 as a means of further clarifying the 

of students relative to the importance of this aspect of 


‘The factor analysis approach might also be used to test for 
res in student values as a result of faculty effort. One could an- 
e ratings previous to such effort and then repeat the analysis 
ing a period of special emphasis on helping students under- 
stand the goals of instruction. If, for example, a preliminary an- 
pe that students placed little value on stimulation to 

as an aspect of good teaching, and if a group of in- 
_Situctors directed their efforts toward helping students understand 
And accept stimulation to thinking as a legitimate goal of instruc- 
tion, Asubsequent analysis should result in the appearance of a 
factor of stimulation to thinking with significant loadings for the 
h rating, 
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OF SLANTED TEXT UPON THE 
_ READABILITY OF PRINT’ 


MILES A, TINKER 


University of Minnesota ð 


ted copy is lying flat, the accomodation changes re- 
or clear seeing as the eyes shift from fixation to fixation 
line of print are relatively slight and do not appear to inter- 
the mechanics of reading. Furthermore, on flat copy the 
ms are clearly visible. In some books the printed material ® 9 
tlie flat, ie., as in many thick texts, dictionaries, and 
ournals. In these books there tends to be considerable 
of the page near the inner or gutter margin where the 
bound. To what degree does this condition interfere with 
nd rapid perception in reading? Apparently there are two 
for interference to arise. First, the eye must accommodate 
ly with each change in focal distance that occurs with 
eye movement to a new fixation along the curved surface. 
iccommodation changes are relatively slow, speed of reading 
‘be retarded when reading is done under such conditions. 
ondly, when part of a curved line of print is farther from the 
pear other parts, the letter and word forms along the line are 
torted, Although this distortion does not become important 
the printed page is nearly flat, it may seriously interfere with 
ity and hence with clear and rapid perception when there is 
able curvature of the printed page. 
specific problem in this study is to investigate changes that 
‘in speed of reading and in visibility of words when the read- 
opy is slanted away from the eyes at different angles. The 
should provide some evidence concerning the effects of ac- 
tion changes and distortions of the visual images of words 
and speed of perception in reading curved lines of print 
books. Speed of perception will be measured in terms of 
reading. The effect from distortions of word forms on ease 
g will be measured in terms of visibility of selected 


research was supported by a grant from the Graduate School, 


ity of Minnesota. a 
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MATERIALS AND PROCEDURE 


The reading material consisted of Forms I and II of Tinker’s 
Speed of Reading Tes. In each form there are four hundred and 
fifty items of thirty words each. As a check on comprehension, the 
reader crosses out the one word that spoils the meaning of each 
iter®. Since responses are more than ninety-nine per cent correct, 
the test virtually measures speed of reading with comprehension 
constant. The two forms, which are approximately equivalent, were 
printed in ten-point Excelsior type in a twenty pica line width with 
two-point leading on eggshell paper stock. 


° © For the visibility measurements, thirty five-letter words were cut | 


from the test and pasted on white cards, one word per®card. All 
words used are within the five hundred most familiar words in the 
Thorndike and Lorge list® and consequently have a high index of 
familiarity. 

The experiment was conducted in a laboratory which provided 


twenty-five foot-candles of indirect, well-diffused incandescent ira 


lumination. The subjects were one hundred and eighty university 
sophomores. They were divided into three groups; one control and 
two experimental groups. The subjects were tested individually, 
and the order of testing was systematically varied. 

A reading stand with the reading surface at forty-five degrees 
to the table was employed for holding the flat copy that was to be 
read in the normal manner (Test Group 1, Control). Two triangu- 
lar copy holders were devised to hold the reading text with the lines 
at a slant; part of the line near subject, part farther away. The 
triangular copy holder was placed upon the regular reading stand 
and so arranged that the center of the slanted lines to be read wer? 
the same distance from the eyes as the copy read upon the regular 
reading stand. One triangular copy holder slanted away from the 
flat reading plane (and consequently from the eyes) at forty-five 
degrees (Experimental Group 2) and the other at sixty degrees 
(Experimental Group 8). re 

On arrival at the laboratory a subject was adapted for five oe 
utes to the illumination employed while taking the practice ex¢ 


2 Not yet available for general distribution. ds. 
3 E. L, Thorndike and I. Lorge. The Teacher’s Work Book of IT 
New York: Bureau of Publications, Teachers College, Columbia Unt 
1944. 
2 
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ing test. In Test Group 1, the control group, sub- 
m I and Form II of the reading test on the flat surface 
r reading stand, i.e., normal reading situation. In Ex- 
Groups (2 and 3), Form I of the test was read in the 
as with the control group. For Group 2, Form II of 
read slanted at forty-five degrees, and for Group 3; 
read slanted at sixty degrees. The time limit was five o 
each test form in all testing. The subject responded by 
loud the word that spoiled the meaning in each item. 
speed of reading tests were finished, visibility measure- 
made with the Luckiesh-Moss Visibility Meter. These, o 
nents were made under the same conditions as the speed 
measurements, i.e., on a flat surface, slanted at forty- 
d at sixty degrees. After six practice words, the visibility of 
ords was measured in each position. The order of presenta- 


j systematically varied. 
; l n 
RESULTS AND DISCUSSION 


basic data for the speed of reading measurements are given 
"able 1. The mean scores in column 4 represent the number of 
of thirty words each read in five-minute periods. In the con- 
toup there wasa statistically insignificant gain of .60 item on 
The means in the experimental groups were ‘corrected’ 
amount, The reliabilities of the scores (Form I vs. Form 1) 
in column 8. They range from .92 to .96. 
p 2 are the results for reading flat copy in comparison 


text at forty-five degrees from the flat surface. Material at 
slower. The criti- 


n Group 3 are the results for reading flat copy VS. COPY at a 
e slant. Here the slanted copy was read 16.4 per cent, 
The difference is highly significant. ; 
for the visibility measurements are given in Table 2. The 


are in terms of scale readings on the visibility meter. The 
was high. 


the three conditions were 


ities for responses within each of 
.93; forty- 


omputed. These (uncorrected) are: flat copy, T = 

slant, r = .88; sixty degree slant,r= 91. 

tion of the data in Table 2 shows that the visibility of 
e 


A 
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words is reduced markedly when the words are at an angle in com- 
parison with flat copy. Thus in column 5, there is a 31.3 per cent 
loss when the slant is forty-five degrees, and a 48.4 per cent logs 
when the slant is sixty degrees. The sixty-degree slant is 24.9 per 
cent worse than the forty-five-degree in reducing visibility, All 
these differences are highly significant as shown by the correspond- 
ing critical ratios in column 7. 

The results in Tables 1 and 2 show that readability of print is 


TABLE 1.—Speep or READING Text on FLAT AND ON SLANTED SURFACES 


Dee 
ifterence® 
Group Test | Position of Copy | Mean | Sky x _| D/SEaut, 


Paragraphs én 


a) Q) (3) (4) (5) (6) (7) (8) (9) 


1 I Flat 52.50 | 1.42 E 
I | Flat Eaa o o] 0-0) -28 

2 | I |+ Flat 52.95 | 1.58] _ B 
I 45° Slant | 50.53 | 1 58 | 7302| 5-7 | -95 

3 I Flat 52.17 | 1.65 | _ 2 | 19077 
Ir 60° Slant | 44.93 | 1.37 | —8-54 | 16-4 9 


* The differences in column 6 are ‘corrected’ by the amount of the dif- 
ference between the mean scores of Form I and Form II of Test Group! 
which serves as a control group. The ‘correction’ amounts to 0.60 items for 
each test group comparison and is subtracted from the Form II score. 


TABLE 2.—VISIBILITY MEASUREMENTS OF WORDS ON FLAT AND ON SLANTED 
SURFACES 


AYES IS a fey) ATR Sor aE A 
Comparison Mean | SEx | ig. Eg Oat] r | D/SEutth 
eae 
m) 
W o joj] o Cnr cel jee! 
Flat vs. 45° Slant 4.44 .25' 9.93 
—31.3 91 
3.05 13 1.39 31 
Flat vs. 60° Slant 4.44 25 12.13 
—48.4 81 
2.29 | 10 | 2-1 


45° Slant vs. 60° Slant | 3.05 -13 


2.29 | .10 i AE R 
A Ste Rate Sel ee cleo EEA D 
ji ; 
2 "i 
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to a marked degree when the copy is held so 

the lings of print are at different distances from the 

true for both speed of reading and for visibility of the 

Although accommodation changes occur during the 

it is difficult to estimate the relative importance of 

this experiment. The effects of the slanted print on 

so great that they would seem to be dominant în re- 

speed of reading. Apparently, a modified design will be 

to get at the accommodation effects. 

proximation to the practical situation would be to 

ing on a curved surface vs. a flat surface. This study is, 
way. 

sults obtained in this study suggest that the marked 

lines of print near the inner margin of large books and 

adversely affect readability. If much wider inner mar- 

used, the situation would be much improved. 


SUMMARY 


his experiment was designed to investigate the effects upon 
‘reading and upon visibility of word forms when the text is 
that parts of a line are at varying distances oat 
terial slanted at forty-five and at sixty degrees was com- 
flat copy. There were one hundred and eighty subjects. 
of reading was retarded 5.7 per cent by the forty-five- 
, and 16.4 per cent by the sixty-degree slant. 
ity of word forms was reduced 31.3 per cent by the 
degree slant, and 48.4 per cent by the sixty-degree slant. 
@ rôle of accommodation changes cannot be determined 
e results, 
is suggested that a major portion of the retardation in read- 
d is due to the reduced visibility of the words. 
results have a bearing upon printing and binding of 
and bound journals where there tends to be consider- 
ature of the lines of print near the inner margin. To avoid 
ious effects of this curvature upon readability, it is sug- 
at large books and journals should be printed with a much 
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DELINEATING THE AREAS OF EDUCATIONAL 
AND CLINICAL PSYCHOLOGY 


RUSSELL N. CASSEL 


" Major, USAF, Scott Air Force Base, Illinois 


The training and duties of the educational and clinical psycholo- 
gists are two separately organized and distinct disciplines, but they 
are very closely related, and there is considerable overlapping of 

_their respective areas. The primary objective of the educational 
psychologist! is to maintain the mental good health of individuals 
through the psychoeducational or mental hygiene clinic and com- 
munity mental hygiene programs (2, 15); while the clinical psy- 
chologist deals primarily with mentally-ill persons emphasizing 
therapy procedures and related practices through the psychiatrie 
or psychological clinic. The educational psychologist deals pri- 
marily with teachers and religious ministers through the public 
schools, churches, and other social and industrial organizations; while 
the clinical psychologist deals with the psychiatrist, medical doctor, 
and other members of the psychiatric team through the hospitals, 
mental institutions, penal institutions, and other agencies charged 
with the care and rehabilitation of mentally-ill persons (1, 5, 14): 

As the increased complexity of the continuous specialized de- 
velopment of these two psychological disciplines emerges, the need 
for a more precise delineation of their respective areas of activity 
becomes more evident. This article defines the boundaries of these 
two important disciplines based on functional objectives as they 
are visualized by the author (5, 6, 7). y 

‘Happiness’ of the human being, in its broadest sense, © w 
ultimate goal for both of these disciplines, and good mental healt 
is a necessary prerequisite for this happiness. The psychological 
structure for human happiness consists of an on-going process R 
goal-setting and goal-striving in which ego-involyement is aes 
(7, 13). In addition, optimum happiness of the individual mus 
insure the greatest space for freedom of movement of the individu! 


er refers 


1 The term ‘educational psychologist? as it is used in this pap sey 


to the practice rather than the teaching; the practitioner rather t! 


professor. 
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predicated on a minimum of infringement of rights of 
[t thrives beat in a social climate effervescing with self- and 
posed barriers to which all individuals are vulnerable 


he educational and clinical psychologists maintain or regain 

ore the mental good health of individuals, they bothgwork 

ly with the personality. The first working primarily with © 
‘individuals, and the latter with atypical and abnormal ones. 

ng psychological dynamics of a traumatic kind seriously 
the mental health of the individual; the after-effect of these 
mal experiences leave mental and emotional scars that impaite @ 
ntal health of the individual equally as much as the loss of 

an arm would impair the physical functioning of the organ- 
just as the prepared vaccines of preventive medicine reduce 

te and consequence of physical illness so, effective mental 

e programs may prevent and control the spread of mental 


tistical data indicate that the precipitation of mental ill- 
crease rapidly after the age of sixty, and even greater after 
enty. In 1900 only a small percentage of our population lived 
ad the age of sixty—about four per cent; today that figure has 
sed to more than seven per cent, and it is anticipated that in 
next ten years more than 11 per cent of the population will live 
d age 60. It may be that the general deterioration of organic 
etioning with age makes the individual more vulnerable to men- 
fal illness; or it is possible that scientific progress in the prevention 
‘treatment of mental illness has not progressed as rapidly as for 
ical illnesses (2). 
n 1946 by passing the Federal Mental Health Act, the United 
has accepted a large portion of the responsibility for the 
ance of good mental health for its citizens just as it had pre- 
y with venereal disease, tuberculosis, small-pox, cancer, an 
physical diseases. The goal of this act is to insure every Amer- 
tizen the opportunity for good mental health by providing 
al support for the development of mental hygiene preventive 
ment programs in our local communities. Increased emphasis — 
een placed on preventive procedures that will insure good 
health for all individuals, and to provide the necessary , 
and research to implement this program. The incre: i 
the educational psychologist in the preventive aspectg of this 
C 


ON 


b> 


294 The Journal of Educational Psychology 


program, has resulted in an immediate need to delineate between 
the areas of these two disciplines. © 


I, THE EDUCATIONAL PSYCHOLOGIST IN THE PSYCHO-EDUCATIONAL 
CLINIC 


Educational psychology as a science stems from the work of Pest- 
alozzi, Herbart, and Froebel in their attempts to psychologize edu- 
cation by revising the methods for the presentation of subject mat- 
ter during the late 19th Century. The second major movement in 
its development was the organization of a course for teachers and 


O prospective teachers in educational psychology by E. L. Thorndike, 


Charles Judd, and Lewis Terman about the same time and around 
the turn of the 20th Century. Since about 1900, educational psy- 
chology has been concerned with three major areas: (1) research 
and the analysis of educational problems, (2) the development and 
organization of principles, approaches, and theories of learning, and 
(8) the guidance and counseling of individuals educationally, voca- 
tionally, and personally. From this last area, and stemming directly 
from the impetus and need furnished by the Mental Health Act, 
has emerged the concept of educational psychology as described 
herein. 

The Psycho-educational or Mental Hygiene Clinic.—In accord- 
ance with the provisions of the Mental Health Act, the mental hy- 
giene clinics sponsored by federal money emphasize preventive 
Measures designed to maintain the mental good health of all indi- 
viduals. An outline of the principal functions imperative to such an 
an effective preventive program, and in which the educational psy- 
chologist becomes the principal figure, is as follows: 


I. Educational and Vocational Guidance 

A) Educational Planning and Guidance: 

1) Junior high school level, 

2) Senior high school level, 

3) College level, 

4) Graduate school level, 

5) Adults. 
B) Vocational and Occupational Planning and Guidance: 

1) Integrating the educational and vocational plans. 

2) Integrating the ‘life-plans’ of the individual. 

5 
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ning for late adolescent and post adolescent individ- 


3 


g, Psychodiagnoses, and Therapy for the Typical 
Personality Class of Individuals 
odiagnoses and Prognoses of Mental Tllness, and the Re- 
of Severe Clinical Pictures. 
dual and Group Psychotherapy. 
rity, Parental, and Personal Aid Relative to Child De- 
ney and Discipline Problems. 
in Dealing With Inter-personal Conflict Problems: ee 
Child-parent conflict. 
ployer-employee conflict. 
‘eacher-pupil conflict. 

conflict. 
conflict. 
8) Marital conflict, and sexual adjustment. 


“Mental Hygiene Programs for the Individual or the 
Community for Maintaining the Mental Good Health 
of the Typical Personality Class 


Prospective groom. i 
3) Prospective ‘in-laws’ and parents of bride and groom. 


Pre-family Planning and Birth Control. i 
atal Parental and Family Adjustment Program. 
D) Infant Care and Child Development. ' 
‘Child Guidance, Discipline, and Mental Hygiene Care. 
The Puberty Cycle and Adolescent Development. 
: Life Planning Programs. R 

Marriage and Sex Adjustment for Adults. 
Middle-age or Menopause Adjustment Program. 

Age or Senile Adjustment Program. 

‘Education and Orientation for the Child: 


Post-puberty cycle. 3 
ding Individuals to Reach Higher Levels of Adjustment. 
; i e 


c 
a 


D 


296 The Journal of Educational Psychology 


IV. Professional and Consultant Services 


A) Conduct Research Related to the Prevention of Mental Illness 
and the Maintenance of Mental Good Health. 
B) Conduct Research Related to Learning and the Learning Proc- 


ess. 

C) Pfofessional Surveys on Matters Pertaining to Learning. 

D) Lectures and Consultant Services Relative to Educational Psy- 
chology. 

E) Organization and Supervision of the Training for the Assist- 
ants and Interns in the Field of Educational Psychology. 


Il, THE CLINICAL PSYCHOLOGIST IN THE PSYCHIATRIC CLINIC 


The discipline of clinical psychology has emerged emulating and 
keeping parallel with the development of clinical psychiatry during 
the early portion of World War II. In the most progressive clinics, 
the trend is to function in conjunction with other members of the 
psychiatric team. The team is usually made up of the following 
members: (1) neuro- or clinical psychiatrist, (2) a clinical psycholo- 
gist, and (3) a social case worker, In general, the psychiatrist heads 
the team and is responsible for codrdinating and integrating the 
services of the other members, and for determining the diagnoses, 
prognoses, and final disposition of the case. The clinical psycholo- 
gist performs a function for the psychiatrist similar to that of the 
medical laboratory for the internal medical practitioner in that he 
administers and interprets psychological tests which are designed 
to reveal the presence of incipient and latent conditions of psy- 
choses, and assists the psychiatrist in other psychological proce- 
dures designed for the treatment and prevention of mental illness. 
The social case worker is primarily concerned with working-up 
and writing the case history of the individual, but, also, assists the 
psychiatrist in psychological procedures designed for the treatment 
and prevention of mental illness, 

Motivated by desire to retain their patients, many medical 
doctors have established the practice of referring mentally ill per- 
Sons to a psychological clinic headed by a clinical psychologist. 
Under this set-up, the clinical psychologist assumes full respon- 
sibility for all psychological aspects, and continues to work with the 
Patient in conjunction with the medical doctor making the referral, 
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ins the responsibility for the physical aspects of the care 
tment. i 
he Psychiatric or Psychological Clinic: —The clinical psycholo- 
deals primarily with those individuals who are diagnosed as 
entally ill, and the emphasis is placed on meliorative or 
we measures which are designed to regain the good mental 
th. Because of the complex nature of mental health and its 
saad with the physical health of the individual, a basic 
fundamental in the meliorative approach is to investigate the pres- 
of physical aspects either before or concomitant with the psy- 
chological approach. In line with this kind of policy, the following@ © 
‘areas of concern constitute the primary functions of the clinical 
chologist:: 
Psychodiagnostic Collaboration with the Psychiatric Team or 
a Medical Doctor for Mentally Ill Persons from the Atypical and 
Abnormal Personality Classes: 
__A) Problems of incipient or latent psychoses. 
© B) Problems of intra-cranial organic pathology. 
©) Severe clinical pictures of psychoneuroses. 
_D) Deterioration or impairment of mental functioning. 
E) Problems of sexual aberrations. 
_ F) Penal and severe delinquent problem cases. 
Œ) Inter-personal dynamic problems of a severe type. 
Il) Meliorative Collaboration with the Psychiatric Team or & 
“Medical Doctor for Individuals from the Atypical or Abnormal 
_ Personality Classes: 
A) Individual or group therapy. 
B) Pre- and post-neurosurgery evaluation and planning. 
_ ©) Rehabilitation and reéducation of neurosurgery patients. 
~ D) Psychoanalyses (where the clinician is properly qualified). 
E) Penal and delinquency rehabilitation. 
F) Plan, organize, and direct an occupational therapy pro- 


‘et 

ih gram. 

IM) Organization and Direction of Speech Clinic: 

____ A) Psychodiagnostic collaboration and treatment of aphasic, 
j apraxia, agnosia, and other related problem cases. 

_ B) Collaboration in the diagnoses and treatment of other 

problems of speech disfunctioning and defects. 
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C) Collaboration in the planning and correction of speech de- 
fects. 
IV) Professional and Consultant Services: 
A) Conducts research related to the diagnoses and treatment 
of mental illness, and its prevention. 
B) Lectures and consultant services relative to clinical psy- 


y chology. 
C. Organization and supervision of the training for assistants 
and interns in the field of clinical psychology. 
l 
9 Ill, TRAINING 4 


-Just as the child of today becomes the parent of tomorrow; so 
the typical individual of one day becomes the atypical and ab- 
normal individual of another day. The training of the educational 
psychologist emphasizes preventive and prophylactic procedures 
and functions; while that for the clinical psychologist, emphasizes 
curative or meliorative ones. There is certainly no province or as- 
pect in the area of psychological procedures and techniques that 
is solely the concern of either of these separate disciplines. In the 
present program of training, as indicated by the present need, 
there appears to be a definite trend in the training of educational 
psychologists to include more and more data relative to projective 
techniques and their use in the clinic, and the provision for clinical 
internships. For clinical psychologists, the training tends to include 
more and more data relative to the psychology of learning, and the 
réle of learning in effective mental hygiene preventive programs | 


SUMMARY 


‘This article is concerned with delineating’ the areas of the edu- 
cational and clinical psychologist as two separate disciplines, and 
describes the functions of each as they are visualized and practiced 
by the author. The Federal Mental Health Act of 1946 has tended 
to emphasize the rôle of the educational psychologist in the preven- 
tion of mental illness among individuals. Thus, the rôle of the edu- | 
cational psychologist tends to center around the preventive or pro- 
phylactic function relative to mental illness; while the clinical 
psychologist is concerned primarily with curative or meliorative 
ones. 
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A COMPARISON OF THE VALUE SYSTEMS 
OF SOUTHERN NEGRO AND NORTHERN 
WHITE YOUTH 


JULIUS H. HUGHES and GEORGE G. THOMPSON 


Syracuse University 


The purpose of this study was to compare the values, as meas- 
ured by value-analysis, of a group of Negro boys and girls living in 


œ the South and attending a racially segregated school with those of 


an equal number of white adolescents living in the North and at- 
tending a non-segregated school. $ 


, EXPERIMENTAL PROCEDURE 


The technique used in the present study was introduced by 
White who made a value-analysis of a Negro novelist’s autobiog- 
-raphy (1). Value-analysis as developed by White consists of three 
steps: 

1) Put in the margin a symbol corresponding to each goal and 
each value-judgment that is explicitly stated in the material, or 
clearly implied by it. 

2) Tabulate these symbols. 

3) Interpret each numerical result in the light of the picture as a 
whole, with special attention to the person’s possible reasons for 
conscious concealment or unconscious self-deception (2). 

The values which were thus analyzed are listed in Table I. The 
list was derived from White’s fifty basic categories of value-analy- 
sis (2). 

The writings that were analyzed for their value judgments were 
themes the experimental subjects were asked to write. The title 
given them was “Things I Would Like to Change in My Way of 
living.” These essays were written in class with instructions to the 
teachers to limit the writing time to exactly thirty minutes. The 
teachers were asked to make no further interpretation of the subject 
on which the students were requested to write. The boys and girls 
were free to define the subject in any manner that they wished. - 

Tt was anticipated that the students might avoid using certain 
words—which might prove to be key ones—because they were in 
doubsabout the spelling of the words. The teachers were, therefore, 
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: to point out to the pupils that they should spell a word the 
a they could whenever they were in doubt about its spelling. 
Bach student recorded his name, age, sex, father’s occupation, 
mother’s occupation, and length of time he had lived in the com- 
munity. The time for recording this information was not included 
the time allowed for the writing of the themes. No student,who 
had resided in his present city for less than one year was included œ 
the study. The teachers in the non-segregated school system 
identifed for the investigators those essays which had been written 
_ by white students. 
Twelve hundred and sixty-nine themes were obtained. Theses @ 
were grouped according to locale, sex, race, and age. Since there 
"were fewer Negro boys in the non-segregated schools than in any 
rt other group, the data were examined to determine what adjacent 
“chronological ages had the largest number of cases. These were 
found to be the fifteen- and sixteen-year-olds. There were twenty- 
"to subjects in the smallest race-locale-sex subgroup of this age 
" dassification. All of the remaining essays written by fifteen- and 
_ sixteen-year-olds were grouped according to race, sex, and city, and 
- each of these themes wasassigned a number. Using a table of random 
_ numbers, twenty-two themes were selected from each group. This 
‘gave a total of eighty-eight essays. These were analyzed by the 
procedures recommended by White in his version of value-analysis. 
Í Tn order to determine scorer reliability, nine of the themes which 
_ Were not selected for the final analysis were analyzed independently 
i bya third-year graduate student in psychology. These inde- 
-pendent judgments were compared with those of the analyst who 
‘a scored all of the eighty-eight themes in the experiment proper. A 
l disagreement was recorded when one analyst recorded a symbol 
Í ed the other did not and when different symbols meaning dif- 
| forent values were used. The following sentence taken from one 
of the themes marked with symbols employed in the study provides 
an example: 


: 


Analyst ‘A’ Analyst ‘B’ 
(Se) Everyone today is living in constant fear of a (‘Se) 
(8) Third World War which probably will kill nine- 

tenths of the people in the world. 


The independent judges agreed that the proper interpretation of 
this response is “there is no security.” Analyst ‘A’, howevey, saw 
i © 


i 
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an additional value of ‘safety’ and indicated that the statement 
suggested ‘no safety’. These were recorded, therefore, as one agree- 
ment and one disagreement. The obtained percentages of agree- 
ment for the nine themes were as follows: 76, 81, 67, 69, 66, 78, 80, 
65, 66. Approximately three months after the initial scoring of 
these nine themes, the principal analyst scored them again inde- | 
® pendently of his first scoring. The obtained percentages of agree- 
ment were as follows: 68, 91, 78, 76, 83, 84, 88, 89, 86. These are in 
keeping with the percentages of agreement reported by White. 


© » RESULTS 


y 

a 
Comparisons were made between the Negro boys and girls of the 
southern community and the northern white adolescents in terms 
of the frequency of the various values reflected in their themes. A 
cursory inspection of Table I reveals that there are a number of 
values which received a similar amount of emphasis from the two 
groups. Both groups gave substantial attention to knowledge: 8.2 
per cent of the total emphasis units for the whites as compared with 
8.0 per cent for the Negroes. The white students demonstrated an 
interest in money, high wages, and low prices as is evidenced by 
5.6 per cent units on the economic value. So did the Negroes, 5.6 per 
cent emphasis units. A common interest in success and superiority 
is suggested by 6.6 per cent of the total emphasis units in the white 
youth’s responses reflecting achievement, and 7.4 per cent empha- 
sis units of the same type in the Negro youth’s responses. That 
both the Negro and the white subjects were interested in having 
friends and being liked is suggested by the fact that they gave 3.4 
per cent and 4.6 per cent, respectively, of their attention to the 
friendship value. 

The data were tested for statistical significance of the differences 
in the proportion of emphasis given the values by the two racial 
groups. Where the theoretical frequencies were less than five, prob- | 
ability was determined by the direct method. $ 

The Negro group gave considerably more emphasis to justice 
than did the white boys and girls (p<.0001). This implies that the 
Negro adolescents were far more concerned with equality of oppor- 


1 The formula used to determine these probabilities was: 
p = ( (& + d)I@ + o)l(b + a)o + d)! mm) 
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fe ‘Taste I.—Emrnasis on DIFFERENT VALUES 


n Whites from Non-Segregated Community; Southern Negroes 
Nn from Segregated Community)! 


Percentage of All Percentage of All 
Emphasis—Units Values Emphas; nits 
a! 


White | Negro White | Megro 


Adjustment 
Aggression 
Beauty 
Recognition 
Rest 
Self-regard 
Cleanliness 
Food 
Pleasant personality 
Appearance 
Group Unity 
Culture 
Dominance 
Sex-love 
Manners 
Likeness 
Obedience 
Truthfulness 
Creative _ self-expres-| 

sion 
Excitement 
Sex 
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iy-nine for the Negro youth. 
cally significant difference with a p value of at least 01. The 
of obtaining six or more significant differences by chance 


and fairness than were the white subjects. The attention of 
o subjects was directed to the group unity value with a 
quency than was that of the white group (p<.0001). 
hite subjects were significantly more concerned with being 
ind happy than were the Negro students. This is indicated 
erence in the emphasis given to the happiness value 
02). Closely related to this difference is is the amount of 
the two groups gave the security value. The, white 
oi 
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boys and girls were more concerned with freedom from worry than 
was the Negro group (p = .0098). ’ 

The kind of companionship that is given by one’s family (family- 

_ love) was given a significantly larger percentage of emphasis by the 
white than by the Negro adolescents (p<.0001). At the same time, 
however, the white subjects mentioned the independence value 

® more than did the Negroes (p = .0098).? 

The above analysis is in terms of the frequency with which the 
two groups mentioned the values. Probably of more significance is 
how the subjects treated the values. Value-analysis provides a 

© means of dividing the values into four categories: frustration and : 
disapproval, satisfaction, desires, and means-end relationships. 

The data were examined for possible differences between the two 
racial groups in the satisfaction category. A significantly larger 
percentage (p = .0044) of the white subjects indicated satisfaction 
in terms of friendships. The white youth also enjoyed greater hap- 
piness (p <.0001) and family-love (p = .0098). The probability of 
obtaining three or more significant differences by chance alone is 
.001. The following statements are illustrative of how the white 
boys and girls treated the above values: 

I am quite satisfied with my way of living. I have a very 
nice home, very good parents a good school in which to goand 
teachers who are swell. 

Our family is an average family, we enjoy living and are 
very happy. 

I am known to be very cheerful and patient among my 
friends, i 

For no value did a significantly larger percentage of the Negro 4 
pupils experience satisfaction. 

An analysis of the frustration area reveals that a significantly 
larger percentage of the Negro boys’ and girls’ emphasis-units dealt -| 


2 The possibility was considered that the greater emphasis on the values 
by one group was the result of a few individuals each mentioning a particular 
value many times. The number of persons mentioning the six values Te 
ported as significant was compared, and the following p values were ob- 
tained: happiness, <.0008; security, <.0110; family-love, <.0003; inde- 
pendence, <.1292; justice, <.0010; group-unity, <.0005. Thus it can be 
seen that the differences between groups are not dependent on multiple 


responses by a few individuals, sa 
ð 
D 
a 
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justice (p <.0001). For instance: 

are many other things I could change but I think this 
more important. Our school system, I think we are just 
as the next person. There should be no segregation in 
ool, and so it is we should have the same rights. 

ys, every where we go whites are always first. » 

ald like to change the way colored people have to ride 


be able to get into college even tho I am going to take 2 
s of Spanish. 

ger vacations on holidays. Less school books published. 
would change my attitude toward studying. I would get 
n to some hard studying instead of watching television all 
ght long until it is to late to study for the test the next day. 
ddition, the white group emphasized a lack of security and 
w safety more than did the Negro boys and girls. For 


ars are the largest ways in which nations have tried to 

what they want. Ever since man came on earth there 

e been wars in which a few have profited and the majority 
e lost. 

There is one very important change and that is, that I 

would like to have grown up in a world which had not known 


They (old people) are put in homes, and other institutions 
they have nothing to do but sit and walk around un- 
d, unloved, and useless. 
bability of obtaining five or more significant differences by 
alone is .001. 
the two areas, frustration and satisfaction, are considered 
er, it appears that the southern Negro youth find the en- 
t more conflict-producing than do the northern white boys 
This is suggested by a frustration-satisfaction ratio of 
° 
© 


e: pa 
have failed algerbra and dropped Latin. I am afraid I will 
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3.00 for the Negro pupils as compared with one of 1.19 for the 

white students. It seems that the Negro adotescents perceive segre- 

gation as a condition which produces inequalities, for most of their 

references to a lack of justice contended that it is ‘unfair’ to have 

to sit in a certain section on a bus, to have separate schools, ete, 

Justice, moreover, accounted for approximately one-third of the 
® — Negro students’ emphasis-units in the frustration area, 

Of the forty-one values mentioned in the desires of the two 
groups, there was a statistically significant difference (at p<.02) . 
for four of the desires. The probability of obtaining four or more 

® w significant differences by chance alone is .008. 3 

Both groups emphasized the desire for knowledge more than 
they did any other value. The Negroes tended to perceive knowl- 
edge as a means of self-enhancement and to use that value to attack 

‘unequal educational opportunities’. The following statements were | 
taken from the themes written by Negro students: | 

Pd like to have it said when I graduate from this school that, _ 
that is one of our best. 

...if there should come a time that we can go to school 
with the other races we should be given the same rights and 
seh should be some Negro teachers, and with the same pril- 
adge. 

I would study a little harder than I have so I won’t have to 
cram everything in my head at one time and stay up half the 
night trying to get it. 

Another thing T would like to change and that is equal right of 
an education. | 

I would endeavor to study more, There are some students I ‘ 
idolize for their brilliant work in school. 

I plan to continue my education beyond high school and 
even beyond my B.S. degree. 

The white boys and girls, too, appeared to be interested in knowl- 
edge as a means of extending the ego. There is some indication that 
they also feel that the school is not just. For example: 

First I think that the periods should be only a half hour 
long and after each period we should be allowed an half hour 
study hall in which to do our homework. 

‘I would like to get higher marks and be a better student, « 
most of all I would like to get a scholarship for college—if it 
Were possible, 


y 
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+ fairly descent marks but I just can’t seem to do 

every night for more than an hour at least. 

like to go to college but I am not smart enough. 

gave substantial emphasis to the economic value (a 

for the white pupils; four, for the Negro pupils). 

s gave significantly more emphasis to the achigve- 
oreover, some of them appeared to view achieve- © 

st a partial solution to their problems. Examples fol- 


think boys who are going to high school or college should 
drafted. They should be allowed to finish. Because the u $ 
rmy force want men who are educated. n 
T would perform all my duties to the best of my ability, and 
ald study so that I would be a better student than which 


if I could ever learn to type. 

‘ould first of all drop all my subjects except cosmetology 
complete that course. After finishing cosmetology and 
g my licience I would continue and finish school. 

, I will strive and study hard to finish high school and 
enter college in the fall of ’52, and to repay my parents 
they have done for me, succeed in my ambition to be a 
in the United States Armed Forces. i 
legro group also gave more emphasis to the following val- 
ice, work, and group unity. These southern Negro ado- 
‘appeared to see a substantial relationship between these 

i, for they tended to treat each in terms of providing equality 
ortunity. Some examples: 

I would get a few more theaters, 
schools oh well mixed every 
n’t matter what color you are it is what one is 
so I would do away with this racial problem. 
„as an American, would like to be treated as an American. 
the Constitution there is a law which says «All men are 


and I would also have 
thing because it really 
on the in- 


God to let it be 


cc 
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applied for the job and were qualified, just because I am a 
Negro and he is white, the white persow would get the job, 

I would like for the relationship between the Negroes and 
whites to change. I would like to get a job on week end days 
like the white boys. Maybe if I had a chance to mix with the 

White boys and girls now it would be a great help. 

I would like to see the time when all men united together 
and help make the United States be a larger and better place 
to live. 

Negros Helpt to build the United States and look at the 
privologes they have they must go to thear own theathers yet 
the foreignor goes to the white and colord shows. 

Another change would be, is getting better jobs for young 
and old Negro people. 

The data were not analyzed for differences in means-end rela- 
tionships because both groups tended to view the values as ends 
in themselves with little expressions of specific means to reach de- 
sired goals. This may have been a function of the topic on which 
the boys and girls were asked to write. 


CONCLUSIONS AND DISCUSSION 


This analysis of the values of the white adolescents living in the 
North and attending a non-segregated school and those of a group 
of Negro boys and girls living in the South and attending a racially 
Segregated school reveals that there is a tendency for both groups 
to place a similar amount of emphasis on the great majority of the 
values. The most outstanding differences in their values are in the 
following areas: justice, family-love, happiness, and group unity. 

The emphasis upon group unity suggests that the southern Negro 
adolescents have a strong drive to be accepted as an essential part 
of the community. The Negro group tended to identify strongly 
With their own ethnic group and appeared tobe quite egotistic. Most 
of their references to injustice, for example, continually demanded 
‘equal rights’ for Negroes, Many of the statements, in fact, implied 
that the Negro is the only group that is not being treated fairly. 
Identification with the Negro race may have been in this instance 
a convenient and socially acceptable means of expressing a desire 
to secure equal rights for the individual. These southern Negro 
boys and girls were straight-forward in, their criticism of segreg®- 
tion. This tends to invalidate the hypothesis held by some 
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the southern Negro is content and happy with his 
Thetheme of social injustice recurred again and 
 racial-segregation setting. This finding is in accord with 

(8) who concluded that “Negro college students 

e of the traditional ‘place’ of the Negro.” 

white boys and girls appeared to have a strgnger 

| with the family. These boys and girls were rather @ 
their treatment of this value. Whereas approximately 

of the emphasis units of this value indicated that they 

‘ing family love, approximately twenty-five per cent sug- 

amily frictions did exist. ‘oo oe 

n white adolescents reflected their envjronment as 
threatening. They did not, however, reflect it as threat- 

id the Negroes. This obtained difference, which was dis- 
‘orable to the white youth on the happiness value, can 
not be interpreted in a simple straight-forward fashion, 

ere is some evidence that for the Negro youth the securing 
justice and group unity is synonymous with having hap- 
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Joun W. M. Warre axb Irvin L. Cmim. Child Training and 
Personality. New Haven, Conn.: Yale University Press, 1953, 
pp. 353. $5.00. : 


à 


4 b 

This extremely interesting and valuable work deals, as the title 
suggests, with some empirically determined relationships in a large 
number of primitive cultures among certain child-rearing practices 
and several adult behavior patterns conventionally subsumed 


wunder the term ‘personality’. While cross-cultural studies have been 


used for many years to help understand and explain personality 
development, these investigations have rarely used psychological 
variables, such as habit or generalization, in their hypotheses, Few, 
if any of these previous investigations have attempted to sub- 
ject their hypotheses to statistical tests with quantified data. 
Whiting and Child, in their use of conventional psychological 
variables and in their attempts at quantification, have clearly 
made two significant contributions to the usefulness of the cross- 
cultural approach for psychology. 

The authors, a cultural anthropologist and an experimental 
Psychologist, were interested in “the extent and ways in which 
Personality processes in individual members of a society determine 
the integration of the culture.” Their general hypotheses were 
couched, for the most part, in terms of reénforcement learning 
theory, using Hull’s theoretical model. Neal Miller’s seminal work 
on conflict and displacement plays an important rôle in several of 
the authors’ theoretical analyses, 

More specifically, the authors studied the child-rearing practices 
in seventy-five primitive societies on which relevant data were 
available in the ethnographic reports of the Human Relations Ares 
Files. To quantify this child-rearing data, each culture was rated 
On the initial satisfaction and the socialization anxiety in five 
Separate areas (orssystems) of parent-child interaction—oral (i.e 
weaning), anal (i.e., toilet training), sexual, dependency, and 
Aggression. The ratings of initial satisfaction were based upon the 
length of initial gratification, the amount of freedom given to the 
child to perform his initial habits, and the amount of concurrent 
anxiety. The ratings of socialization anxiety or the concurrent 
anxiety involved in imposing inhibitory controls were based upon 
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wity of initial satisfaction, the severity and frequency of 
nent, and by overt signs of emotional disturbance in the 
mn. These quantitative ratings of initial satisfaction and 
ization anxiety were then compared with the particular ex- 
s of illness and the consequent therapeutic practices in 
cultures. * n 

wing the specific hypotheses for testing much use was 
the concepts of fixation, identification, projection, and 
t. While drawing upon the original psychoanalytic 
tions for a beginning, they then translated these notions 


nd Miller (Personality and Psychotherapy). 
articular interest to psychologists is Whiting and Child’s 
of fixation. They believe that the consequences of ex- 
frustration and extreme gratification are quite dissimilar 
at inclusion of both antecedent conditions under a general 
of fixation masks the potential usefulness of such an inter- 
on. They suggest that fixation resulting from a high degree 
tification be termed ‘positive fixation’ while that resulting 
high degree of frustration be termed ‘negative fixation’. The 
| of negative fixation is used in conceptualizing the relation- 
n socialization anxiety and explanation of illness, while 
e fixation is used in conceptualizing the relationship between 
‘Satisfaction and the therapeutic practices in illness. The 
of negative fixation would suggest, in this instance, that 
high in oral socialization anxiety should demonstrate this 
utilizing oral explanations of illness (i.e., some act of 
on the part of the patient is responsible for the illness), 
@ societies with low oral socialization anxiety should use such 
tions less frequently. The concept of positive fixation would 
iat high initial satisfaction in an area should give rise to 
lormance of responses in that area as a therapy in illness, 
a initial oral satisfaction should lead to some sort of oral 
by. These two concepts of fixation areyused in a similar 
ion to study socialization anxiety and initial satisfaction in 
other four areas of development (anal, sexual, dependency, 
ssion) and the concomitant explanations of illness and 
tic practices, The over-all empirical evidence is interpreted 
the separation between positive and negative fixation 
evidence for a concept of negative fixation. The lack 


) their ‘general behavior theory’ much in the fashion of æ 


D 
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of strong support for a concept of positive fixation is explained in 
terms of the possible inadequacy of use of “therapeutics as the 
criterion variable, the inadequacy of the data on therapeutic prac- 
tices, and the more general problem of progressive satisfactions, 
i.e.; the changing nature of fixated goals with maturity. The au- 
thors, suggest that this dichotomy between positive and negative 
fixation be further studied in future investigations of this sort. 

The text goes on to discuss the origins of guilt as a function of 
identification and the origins of the unrealistic fear of other persons 
and of spirits as a function of anxiety about aggression. Limitations 
Hf space prevent a fuller exposition of these hypotheses. 

The authors come to two rather general but related conclusions 
in their final chapter. Firstly, the early life experiences of a person 
do have a persistent effect on this personality. Secondly, the per- 
sonality characteristics of the typical members of a society do 
determine many of the beliefs and practices of that society. These 
conclusions are bolstered by the more specific findings partially 
discussed above. 

The potential reader should be forewarned about one possible 


` danger in reading this text. The highly sophisticated statistical 


procedures (e.g., normalizing and pooling several judges’ ratings 
(p. 232 ff.)) might lull the unwary reader in forgetting the coarse- 
ness of the basic data and the real limitations such coarseness sets. 
While the authors mention this consideration, it remains a problem. 
The potential usefulness of this methodology coupled with the 
present positive findings suggest that any future investigations of 
this type would profit from the use of rather complete ethnological 
data collected with the specific variables to be studied in mind. 
Tn some places the authors’ speculations and theorizing transcends 
the available data. For the most part, fortunately, these specula- 
tions are clearly labeled as such. 

These minor criticisms in no way detract from the over-all ex- 
cellence and importance of this text. It represents a significant 
contribution to behavior theory, child psychology and cultural 
anthropology. Specialists in these areas cannot afford to neglect 
this important contribution to our empirical knowledge and theo- 
retical understanding. Lzonarp D. GoopstTEIN 

State University of Towa 
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q SHAFFER AND Ricuarp S. Lazarus. Fundamental Con- 
‘in Clinicdl Psychology. New York: McGraw-Hill Book 
952, pp. 540. 


od textbooks in the field of clinical psychology are looked for 
yerness at the present time. Shaffer and Lazarus havestried 
es, methodology, and general techniques.” The jacket 
promises that the book “provides the training which is 
ble for understanding and competance in the specific 


more modestly that the book may “provide the foundation 
y for growth in proficiency and for more complete under- 
ng of the administration and interpretation of the technical 
ments of diagnosis and the methods of conducting a thera- 
atic interview.” 

“The book very properly assumes that the reader will have as 


quisite, “some knowledge of general and abnormal psy- 
» + 


rief historical survey brings one to the present situation in 
cal psychology. The main subjects are dealt with in the follow- 
rder: methodology; clinical techniques, intelligence and its 
ement; defects of intelligence; the nature of personality; 
onality measurement; introduction to psychotherapy; psycho- 
peutic devices; psychoanalysis; distributive analysis and 
thesis; special psychotherapies; physical and chemical therapies; 
the clinician in action. 
“One is perhaps impressed most with the excellent balance and 
dgment found throughout the book. It is generally possible 
the critic to point out over-emphasis on some subjects, and 
i lure to properly include and discuss others; and although this- 
5 possible for the present volume, the more accurate criticism is 
that there has been excellent selection of material and fine judg- 
ht in its treatment. z 
` ‘the reader is interested in the practical applications of psy- 
E chology in one of its most important fields of application, he cannot 
to miss the critical summaries presented in this book; if he 
_ 18 interested in fundamental facts and concepts of clinical psy- 
_ thology, he may be surprised and pleased at the amount of general 
= 8nd historical information condensed in its pages. The condensed 
4 


o 


& 


great deal of success to provide an introduction to “basic © 


techniques.” The authors, with whom this reviewer agree% @ 
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form of writing does not mar the readability of the book but 
rather adds to its value. a 

It is interesting to note that although neither of the authors 
accepts the old ‘faculty’ psychology, (see p. 98), yet the old verbal 
habit can lead them to say that, “The reader is urged persistently 
to develop the critical faculty ...” (p. v1). Which suggests that 
while the old determining tendencies (shall we say habits?) did not 


- fail to show up in the writing, they might well have appeared more 


casually in the discussions. Shaffer or his co-author in another text 
wrote that, “The importance of habit formation in abnormal 
psychology is so great that it is not apt to be over-emphasized.” 

This most useful text not only presents to the reader well 
selected discussions of most of the fundamental matters which 
should be studied by the beginning student in clinical psychology, 
but the bibliography (pp. 495-525) puts him in touch with the 
remainder. It would have been of added value to the student if 
some of these could have been starred to indicate the most valuable 
collateral readings for the student. Pointing these out is perhaps 
the more appropriate task of the instructor. 

‘Two well-made indexes complete the book. It will perhaps be the 
leading textbook for some years to come and is at the same time 
a valuable book of reference, A. S. EDWARDS 


The University of Georgia 


Oscar KRISEN Buros, Ep. The Fourth Mental Measurements 
Yearbook. Highland Park, N. J.: The Gryphon Press, 1953, 
Pp. 1163, $18.00. 


Every testing specialist has learned to rely on the Buros Year- 
books as desk companions. Every teacher or counselor using pub- 
lished tests should study MMY as a whole, and consult it fre- 
quently. MMY is a bibliographer’s dream: references to all tests, 
to all recent studies on each, and to books on testing. The organiza- 
tion and printing are magnificent. 

The heart of the book is its critical reviews of tests. In the new 
Yearbook, these are invariably thoughtful. Except in the area of 
classroom achievement tests, reviewers have frequently tried out 
the tests and many reviewers report minor research performed to 
® 
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data for the review. Criteria for judgment are much 
jn former yearbooks. The reviews give less weight 
y to picayune errors, more to the function of a test in 
and clinical interaction. A long honor roll could be 
superior reviews. The following have exceptional gen- 
spectacular review by Shimberg of Health pwen- 
; Anastasi on ‘primary abilities’ tests; Westby on Pro- © 
Matrices; Courtis on the Thorndike Handwriting Scale 
le history); Fowler on Henmon-Nelson; and Kaulfers on 


yearbook draws attention to new tests which’might other, © 
‘be overlooked. Judging from the reviews, the person interested 
ts should become acquainted with these: 1947 Progressive 
California Cumulative Guidance Record (Elem.), Diferen- 
bilities Test, SRA Self Scorer, IPAT Culture-Free Intelligence, 
Aptitude, Diagnostic Reading Tests, Wing Musical Intelli- 
Children’s Apperception Test, Functional Evaluation in 
atics. While important, the General Aptitude Tests of USES 
the Weschler Scale for Children were disappointing to reviewers. 
are a good many negative reviews. 
ily three major publishers—World Book, Psychological Cor- 
tion, and ETS—are generally producing tests which the re- 
regard as adequately developed, and even these houses fail 
to maintain high standards consistently. Some other large pub- 
k are accused of serious violations of judgment and even of 
ie s. Over and over, reviewers say that tests have been released 
í turely, that they are acceptable for research use but because 
ficient validation or standardization are not suitable for 
ical use. This points up a major policy issue. Should we en- 
ürage the release of a test incorporating new ideas, even though 
s fundamental test development is incomplete? We need innova- 


ority of newer tests are at present suitable for use only 
se who will conduct research on the tests in their own setting, 
usually distributed without such limitation. A favor- 
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able sign is the report that many manuals have been revised to 
meet criticism from the last MMY. Older tests, however, have 
generally continued in print with neither revision nor continued 
research. Their authors would do well to withdraw some of them, 

Buros has now perfected his yearbook technique. It is regrettably 
necessary to question the program and basic conception he holds, 
even though the series is marvellous if it can be sustained. The 
price of the volume is $18, and this does not cover all costs, MMY 
is priced out of its market. If it cannot sell in volume, some new 
plan is needed. Changes can be made which will reduce the year- 
þook’s value far less than they will reduce costs. An editor might 
consider these alterations. (1) Eliminate four hundred and fifty 
pages of reprinted book review excerpts, a costly feature not much 
used. (2) Eliminate reviews of foreign tests unlikely to be suitable 
for the American market. This is a provincial suggestion, but it 
would be better at this point to disseminate MMY more widely in 
this country than to pursue comprehensiveness. (3) Remove the 
exhaustive and expensive bibliographies on tests. (4) Eliminate 
analytic reviews of obsolete tests. A balanced review of (for ex- 
ample) the Zyve test is not needed. One sentence stating that this 
test should not be used for such and such purposes, signed if neces- 
sary by a group of adequate authorities, tells the student all he 
needs to know. Buros has this year made an effort to resurvey old 
tests, to such lengths as seven pages on variants of Army Alpha. 
Some retrospective reviews are of current value, but most could 
be reduced to much more blunt statements that a test is obsolete, 
or that, while originally ‘experimental’, in twenty years it has not 
been improved to the point of usefulness. (5) Possibly more radical 
suggestions would be worth considering. Suppose the test-by-test 
reviews in the vocational interest field, for example, were replaced 
by a single summary of the field, signed by Super, Bordin, Carter 
and Bennett. This could provide perspective, balance, and guid- 
ance far beyond that allowed by the present organization, however 
perfect MMY is by librarianship standards. 

Few individuals have made so singular a contribution to the 
profession as Buros. Devotion, self-sacrifice, clarity of purpose, 
and editorial talent are evidenced in every page of Buros’ series. 
It is gratifying that the leading measurement organizations have 
recently adopted a testimonial resolution recognizing this. Criti- 
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; ofthis elegance, and that Buros can compromise 
ideals without seriously compromising his pro- 
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CHILDREN OF COMPARABLE-MENTAL | 
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In a recent study by Kolstoe (2), the usefulness of mental age 
“a8 a unit of measurement was investigated by comparing the 
mental abilities of bright and dull children of similar mental ages. 
In general, his results revealed only minor differences in the traits 
measured. The present investigation is a companion study which 
involved essentially the same samples of bright and dull children 
as those employed by Kolstoe. Its purpose was to determine the 
extent to which children of equal mental age but markedly differ- 
ent in chronological age and IQ tend to be alike with respect to 
achievement in reading. This was done by comparing bright and 
dull children with approximately equal estimated true mental 
ages with respect to each of several abilities involved in reading 
comprehension. The criterion for ‘dull’ was an estimated true 
Stanford-Binet IQ of 84 or below; the one for ‘bright’ was an 
estimated true Stanford-Binet IQ of 116 or above. Thus the 
minimum separation between the two groups was approximately 
two standard deviation units along the IQ continuum. Estimates 
of the true 1Q’s and MA’s were obtained by applying the standard. 
formula for correction for regression to the obtained 1Q’s and 
MA’s, with the reliability coefficients used in the formula being 
those reported by Terman and Merrill for various IQ levels. 

Results of the extensive investigations of Lewis (3) and of 
= 

1 This article is based on the writer’s Ph.D. dissertation. The work was 
done at the State University of Iowa under the co-direction of Professors 


i Si « Hieronymus and J. B. Stroud. 
; >L. M. Terman and M. A. Merrill, Measuring Intelligence, Boston: ie 


Houghton Mifflin Co., 1937, e 
hay , p. 46. 
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intelligence test to ‘over-achieve’ or a pupil near the first percentile 
to ‘under-achieve’. 

In a study by Ramaseshan (6), Mith-erade pupils of similar 
mental ages (Chicago Tests of Primary Mental Abilities) were 
grouped according to chronological age. Children with similar 
mental ages but different chronological ages (and, consequently, 
differing 1's) were compared on the subtests of the Jowa Tests of 
Educational Development. Her bright group was found to exe 
on all of the subtests of the ITED, but differences were statistically 
significant for only three of these subtests. Of some pertinenc® 
to the purposes of the writer’s study is the fact that ‘no significant 
differences were found on the three subtests measuring the ability 
to read and interpret material in specific content areas. 
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ot to reading comprehension scores earned on the Towa 
¢ Skills, Unsicker found significant differences favor- 
t group. Differences between the groups matched on 
California MA’s were not significant. 
d n to the fact that diferent criterion tests were em- 
id, the present study differs from most of those in which the 
a has been investigated previously in that estimated true 
used in the matching procedure, individually admin- 
rence tests rather than group tests were employed, 


el 


th ceiling and floor effects did not operate to bias the means 


PROCEDURE 


of Sampling—In order to measure a wide range of 
hension abilities, it was decided to employ test materials 
ire appropriate at the fourth- to fifth-grade level of reading 
Considering the 1Q criteria for ‘bright’ and ‘dull’ (lower 
per 1Q limits of 116 and 84, respectively), and studying the 
p of MA’s at various CA’s for these bright and dull criteria, 
cided to use a mental age range of from ten years, seven 
ath through twelve years, six months, and to restrict the 
Ww group to children with CA’s of ten years or less and the dull 
up to children with CA of fourteen years or above, 
ildren in the dull group were selected from regular eighth- 
ninth-grade classes and some special education classes in two 
ilor high schools, and children in the bright group from regular 
nd- and fourth-grade classes in three elementary schools, in the 
blic school system of a large Iowa city. For identification of 
ikely to meet sample specifications, the cumulative record 
of each child in the two junior high schools, and in third 
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TABLE I.—CHARACTERISTICS OF THE SAMPLES 


Grolips 
Bright Dull 
Range of CA’s 8-7 through 9-10 14-2 through 16-3 
o Mean TA 92.5 15-5.4 
Range of MA’s (est. true) 10-8 through 12-6 | 10-8 through 12-6 
Mean MA 11-3.2 11-3.0 
Range of IQ’s (est. true) 116 through 138 72 through 84 
Mean IQ 126.5 79.5 
O 


Stanford-Binet Scale, Form L, was administered until there were 
obtained twenty-nine pupils who met the following specifications: 

1) Estimated true IQ’s of 84 or below. 

2) Estimated true MA’s of from ten years, seven months, 
through twelve years, six months. 

From the list of third- and fourth-grade pupils indicated as 
likely to meet sample specifications,for the bright group, pupils 
were selected randomly and the Revised Stanford-Binet Scale, 
Form L, was administered until there was obtained a sample of 
pupils meeting the following specifications: 

1) Estimated true IQ’s of 116 or above. 

2) Estimated true MA’s of from ten years, seven months, 
through twelve years, six months. 

8) As many bright children in each of four six-months mental 
age intervals or levels (which constituted the two-year range 
indicated in the preceding specification) as there were dull children 
in that interval. 

In the process of obtaining enough bright cases in each level to 
match the number of dull cases in that level, extra cases were 
obtained for some of the levels. For purposes of statistical analy- 
sis, extra cases in each level were later discarded randomly. The 
final sample contained twenty-eight children in each group, dull 
and bright, one case originally selected for the dull group having 
been lost because of incomplete data. In the dull group, there 
were fifteen boys and thirteen girls; thirteen in Grade 8, six ™ 
Grade 9, and nine in special classes. The bright group was Com: 
posed of sixteen boys and twelve girls; fifteen in Grade 3 and 
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‘thirteen in Grade 4. A summary of information relative to the 
‘dull and bright groups in the sample is presented in Table I. 

Selection of Comprehension Abilities Investigated.—A. survey of 
the professional literature, teachers’ manuals accompanying series 
of readers, standardized tests, and reported results of factorial 
studies was made for suggestions of specific abilities invelved in 
reading comprehension. Consideration was also given to availa-® 
bility of measuring instruments for these specific abilities, the 
possibilities of adapting available instruments to the purposes of 
this study, and the purported importance of given abilities. This 
resulted in the selection of the following abilities for inclusion in @ è 
this investigation: 

1) Word Recognition (the ability to recognize given words 
‘on sight’). 

2) Word Meaning (the ability to understand or recognize the 
particular meaning of a word as it is used in context). 

a 3) Memory for Factual Details (the ability to recall. specific 
facts which have been definitely stated in a selection). 

4) Location or Recognition of Factual Details (the ability to 
locate or recognize specific factual details which are explicitly 
stated in a given selection). 

5) Perception of Relationships among Definitely Stated Ideas 
(the ability to recognize or to formulate an idea which is not 
explicitly stated in a selection but which is contained in the selec- 
tion when two or more definitely stated ideas are considered to- 
gether). 

6) Recognition of Main Ideas (the ability to recognize the 
central thought or main idea of an entire selection, a paragraph, 
or a specific part of a paragraph). 

7) Drawing Inferences and Conclusions (the ability to recognize, 
or to formulate, an idea which is not stated in a selection but which 
is dependent upon the combination of an idea (or ideas) which is 
(are) definitely stated in a selection and one which is outside the 
selection and within the informational or experiential background 
of the individual). 
hh addition to these abilities, measures of reading rate and 
listening comprehension were also obtained. Reading rate scores 
Were secured not only because comparisons between the groups 
Were of some interest in themselves, but also because marked ‘ 
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differences in rate could be responsible for differences in compre- 
hension abilities, even though the tests were untimed. The 
measures of listening comprehension were obtained in order to 
determine whether a in general comprehension ability 
exist when unencumbered by possible difficulties with mechanical 
skills ingreading. 
® Criterion Tests Used—Highty words from the Flashed Word 
Recognition and Word Analysis Test of the Durrell Analysis of 
Reading Difficulty, and twenty more difficult words from various 
forms (Q through T) of Part II (Vocabulary) of the Reading 


< Comprehension Test, Advanced Battery, of the Iowa Tests of 


Basic Skills, were included in the Word Recognition Test. These 
one hundred words were arranged in lists of twenty each and were 
presented tachistoscopically for a duration of approximately one 
second. Only results obtained with the last four lists (eighty 
words) were included in the analysis of results. 

The Word Meaning Test was a multiple response test consisting 
of a representative sample of fifty items chosen from various 
forms (Q through T) of Part II (Vocabulary) of the Reading 
Comprehension Test, Elementary and Advanced Batteries, of the 
Towa Tests of Basic Skills. 

A test of Comprehension Abilities was made up of nine reading 
selections and one hundred and thirty items which were adapted 
from the Reading Comprehension Test of the Elementary and 
Advanced Batteries of the Iowa Tests of Basic Skills, Forms L 
through T. Five subtest scores, each based on from twenty-five 
to twenty-seven items, were obtained as measures of these five 
Specific abilities: Memory for Factual Details, Location or Recog- 
nition of Factual Details, Perception of Relationships among 
Definitely Stated Ideas, Recognition of Main Ideas, and Drawing 
Inferences and Conclusions. The reading selections were chosen 
on the basis of relative difficulty, apparent interest value, and the 
extent to which items accompanying selections represented the 
Specific abilities named above. To obtain subtests of approxi- 
mately equal length and difficulty, a number of original items wer? 
eliminated and additional ones were constructed when necessary: 

A Listening Comprehension Test was constructed in a manner 
similar to that for the Test of Comprehension Abilities, except 
that items for Location or Recognition of Factual Details were not 

Includes. The test consisted of four reading selections and forty- 
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items. The test booklets contained onlyfthe questions 
ed to the selections. The selections and the test items were 
read aloud to the subjects by the examiner, and subjects marked 
their chosen responses to each question after it and its answer 
choices had been read. 
The Reading Rate Test consisted of a selection of approfimately 
en hundred words of fourth- to fiftlngrade level of difficulty. 
Subjects were instructed to read the selection once “at the same 
speed as you usually read.” They were told before they began 
that there would be questions over the material read. A short 
comprehension test, consisting mainly of items measuring mem®ry @ » 
"for details, followed the reading of the selection. The rate score 
i was a complement of the number of complete ten-second intervals 
_ which had elapsed during the reading of the selection. 
All of the criterion tests were administered by the writer during 
an eight-day testing period. All were administered as group 
__ tests, without time limits, with the exception of the Word Recog- 
nition Test, which was administered to each subject individually. 
All test items, except those for the Word Recognition Test, were 
"of the four-choice multiple-response type. Except for the Reading 
Rate Test rate scores, all scores used in the analyses of results 
j ‘Tepresented the number of items answered correctly. 
_ Analysis of Results —An analysis of variance design, ‘group-by- 
levels’, was employed in the analysis of results.’ The bright and 
dull children represent the ‘groups,’ and the intervals of six 
Months in the two-year mental age range used in the study con- 
stitute the ‘levels? A schematic presentation of the groups-by- 
levels design as it applies to this study is shown below. 


id Criterion Scores 
MA Level MA (Ed True) eee 
„| Bright Group Dull Group 
ee I 12-1 to 12-6 n=7 n=7 
; peg 11-7 to 12-0 n=9 n=9 
K HI 11-1 to 11-6 n=7 n=7 
i IV 10-7 to 11-0 n=5 n=65 


_ Tn the case of each ability investigated, the null hypothesis was 

>= : 4 

yy E. F. Lindquist, Design and Analysis of Experiments in Psychology and m 
; Education, Boston: Houghton Mifflin Co., 1952. e 
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Taste II.—Summary or Resuuts: Mean Scores ror Eacu Grovp, 
DUFFERENCES BETWEEN MEANS, AND F-Vayps, ror Eacu 


ABILITY 
Bright Dull = 
Abit -rar Ft 
š Mean sD Mean sD 
Word Recognition 8.6 | 11.8 | 56.2 | 13.0 | +2.4 | 0.499 
Word Meaning 30.4 | 6.1 |30.8| 6.1 | —0.4 | 0.076 
4.0 | 15.1 4.8 +2.3 3.678 


Memory for Factual De- | 17.4 
tails 


g estion or Recognition | 15.8 | 3.6 | 12.7 | 3.3 +3.1 | 10.126** 
of Factual Details 
Perception of Relation- | 15.4 | 3.6 | 14.0 | 3.6 +1.4 | 2.058 
ships among Definitely 
Stated Ideas 


j 


Recognitionof Main Ideas | 16.3 | 4.1 | 13.9 | 3.9 +2.4 | 5.319** 

Drawing Inferences and | 14.9 | 3.1 | 12.9 3.7 +2.0 | 4.628** 
Conclusions i 

Total Comprehension | 79.8 | 15.1 .4 | 16.7 | +11.4 | 7.922** 


68 
Abilities | 
Listening Comprehension | 30.6} 3.6 | 23.6 | 4%7 | +7.0 | 47.574** 
Reading Rate (rate score) | 61.6 | 17.9 | 55.4|%5.9| +6.2 | 1.602 
Reading Rate (compre- | 10.7 | 2.4| 9.8| 3.4| +0.9 | 1.444 
hension score) 


* For each F-value, df = 1, 48; Fos 44.04, X 
** Significant at 5 per cent level. 


tested, d.e., that the means of the populations pf which the dull 
and bright groups were representative samples were the same. To 
test this hypothesis, the ratio of the mean square for groups to 
the mean square for within cells was employed. This ratio yields 
a value which is distributed as F, prgyided that the hypothesis 18 
true and that certain conditions are met. A five per cent coeffi- 
cient of risk, selected in,advanee of the analyses, was employed in 
rejecting the null hypothesis. ~* 


RESULTS 
A summary of the obtained results is presented in Table T 
The table includes the mean and standard deviations for ea0 
group, the differences between the means, and the F-values ob- 
tained in the tests of the significance of the differences between 
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the means of the bright and dull groups. Positive differences favor 
the bright group. » 

Obtained differences between the mean scores of the two groups 
favored the bright group with respect to all of the abilities except 
Word Meaning. In the case of this one exception, the difference 
was not only nonsignificant, but also less than one raw sqpre unit. 
The analyses of variance yielded significant differences with® 
respect to the following five abilities: Location or Recognition of 
Factual Details, Recognition of Main Ideas, Drawing Inferences 
and Conclusions, Total Comprehension Abilities, and Listening 
Comprehension. While not significant at the required lgvel, œ 
differences between the mean scores for Memory for Factual 
Details and for Perception of Relationships among Definitely 
Stated Ideas were substantial, and further investigation with 
respect to these two abilities seems warranted. Differences with 
respect to Word Recognition, Word Meaning, and Reading Rate 
were not significant. 

The Test of Listening Corhprehension was included in anticipa- 
tion of the possible egent that the group which was found signifi- 

“cantly poorer on most of the abilities would be found to be signifi- 
cantly better with respect to listening comprehension. Then such 
results might have been interpretéd in terms of possible difficulty 
with mechanical'skills in reading rather than in terms of differences 
in intellectual abilities or specific tomprehension abilities. How- 
ever, the bright group, which excelled on nearly all of the abilities, 
‘also excelled significantly oh the Listening Cémprehension Test. 
Similarly, if a significantly higher rate had been found for the 
group also found to be significantly poorer with respect to most of 
the abilities, the poorer showing might have been attributed, in 
part, to tendencies to readearelessly and too hastily. However, 
No significant rate difference was found between the two groups 
and analysis of the reading rate comprehension check indicated 
» that the two groups had read the reading rate selection with 
comparable degrees of understanding. These findings with respect 
to reading rate and listening comprehension tend to further in- 
dicate that superjority in reading comprehension involves supe- 
tiority in intellectual functions rather than in the more mechani- 
cal skills. 
Tests for interaction between groups and levels were also made. 
8 
Q 
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No significant interaction effects were found for any of the abilities 
tested, thus satisfying one of the necessary conditions or under- 
lying assumptions involved in the particular design of the study. 
Inspection of the frequency distributions for the various abilities 
involved in this study revealed that, in general, obtained scores 
did not @losely approach the maximum possible at the upper end 
“Of the distribution or the ‘chance’ scores at the lower end. Thus, 
neither ceiling nor floor effects operated to bias the results. 


CONCLUSIONS 


<2 In the strictest sense, the sample studied may be regarded as a 
Tepresentative sample only of hypothetical populations that show 
the same relative distribution of MA’s, 1Q’s, and CA’s as the groups 
in the sample itself; and generalizations based upon obtained results 
should be restricted to these hypothetical populations, However, 
since no significant interaction effects between groups and levels 
were found, restrictions upon extending generalizations to real 
populations may be lifted to a considerable extent. Therefore, 

izing to a population of dull and bright children with 
widely differing 1Q’s but approximately equal MA’s within the 
MA ranges found in this study and with reference to the various 
comprehension abilities as defined operationally, the following 
conclusions seem warranted: aii Bi 

1) Bright children are significantly superior to dull children of 
comparable mental ages with respect to total reading comprehen- 
sion and the following specific abilities: locating or recognizing 
factual details, recognizing main ideas, and drawing inferences and 

2) Tt seems probable that bright children are also superior to 
dull children of comparable mental agés with respect to memory 
for factual details and perception of relationships among definitely 
stated ideas, acne w 

3) Bright children are superior to dull children of comparable 
mental ages with respect to listening comprehension. 

4) Reading rates of bright and dull children of comparable 
mental ages appear to be approximately the same when comparable 
degrees of understanding of material read are attained, with 2 
wide range in rate being found in both groups. _ 

| 5) Bright and dull children tend to be alike with respect to 
any i} word recognition and word meaning. Bright children 
Qo. 
2 ° 
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fi icantly superior to dull children of comparable mental 
respect to the relatively more complex, and intellectual, 
e n abilities. 

t would seem that levels of expectation with respect to the 
fe complex comprehension abilities should not be as high for 
ch as for bright children of comparable mentak ages. 
; © 
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between prejudice and certain patterns of emotional development 
and expression, it would appear logical that one approach to 
modifying prejudice might be through techniques which focus 
primarily on many of the same emotional factors. Thus, within 
the setting of the college classroom an experiment was designed 
which would compare the relative effectiveness of two methods of 
instruction in modifying attitudes associated with racial, religious 
and ethnic prejudice: one method utilized such group therapy 
procedures as nondirective methods of instruction and socio- 
drama, while the other employed the traditional lecture-discussion 
method. A secondary purpose of this research was to investigate 
the relationship between personality adjustment and the modifica- 
tion of the social attitudes with which this study is concerned. 
Although not directly within the scope of this investigation, the 
subsidiary problem of course content acquisition for the two 
classes taught by the investigator was examined. : 
Recent research on the problem of minority-group prejudice 
has indicated the close relationship between certain socially- 
determined personality variables and prejudice. It has also been 
demonstrated that client-centered and sociodramatic therapeutic 
procedures can be adapted to classroom situations. (2, 3, 4) Con- 
Sequently, it would appear logical that one approach in modifying 
prejudice might be through utilizing group techniques in a class- 
room situation which focus primarily on many of the same pèr- 
sonality variables. 
© This article is a summary of a Doctoral dissertation completed at New 
York University. It was sponsored by Professor Charles E. Skinner, tO 


whom the investigator wishes to express his deep appreciation for the in- 
terest and assistance provided. 
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The experimental design was organized in a manner which 
would permit integration of the research techniques into the 
fabric of the courses. Three instruments were employed; namely, 
the California Public Opinion Study E-F Scales, the Index of 
Adjustment and Values, and the Minnesota Multiphasic Inven- 
tory, both before and after the experimental period, to ppeasure 


. any changes. An objective-type course-content examination © 


prepared by the investigator was administered to the two experi- 
mental classes at the midpoint and ending of the semester. The 
experimental classes were conducted by the investigator, who 


randomly selected one group to be taught by the utilization, of @ 


group therapy procedures and the other to be conducted by the 
traditional lecture-discussion method. A third class, taught by 
another instructor using traditional methods of instruction, 
served as one control. Finally, another group which was composed 
of students registered in various courses within the same curricu- 
lum served as a time-factor control. All groups were compared on 
the significant variables and appeared to come from the same 
parent population. 


THE PROCEDURE 


The actual steps taken in the organization of this research are 
as follows: 

A) In September, approximately one week prior to the begin- 
ning of the school semester, the California E-F Attitude Scales, 
the Minnesota Multiphasic Personality Inventory, and the Otis 
Self-Administering Tests of Mental Ability were administered to 
a group of students (Control Group I) entering the Vocational 
Diploma Program at Brooklyn College. 

B) During the second class session of the two sections of Practi- 
eal Psychology D1 (the group therapy class and the traditional 
lecture-discussion class), given by the investigator, and of a third 
section of the same course given by another instructor (Control 
Group II), the E-F Attitude Scales and the Minnesota Multiphasic 
Personality Inventory were administered. The Otis Self-Adminis- 
tering Tests of Mental Ability was part of the entrance testing 
battery of all Vocational Diploma Students desiring admission to 
the College. 

C) The investigator conducted one section by the traditional 
lecture-discussion method and the other using groupgtherapy 
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procedures. The codperating instructor conducted his section by 
the traditional lecture-discussion method. The. fourth group was 
distributed among many classes, many of the students not regis- 
tered for Practical Psychology D1. 
D) During the seventh class meeting the Bills’ Index of Adjust- 
ment apd Values was administered to the two classes conducted 
“by the investigator. 
» E) During the last session of the semester the four groups were 
retested with the original test instruments, including the Index of 
Adjustment and Values, with the exception of the Otis Self-Ad- 


_ < migistering Test of Mental Ability. 


MP 


Before- and after-testing of attitudes and personality was 
conducted in order to measure any changes which might have 
taken place as a result of the teaching methods used. The purpose 
of the control groups was to estimate the effect of factors other 
than those of the teaching methods used; i.e., bias of instructor 
and the many uncontrollable variables attendant on just being 
registered in college for a similar period of time. Thus, there were 
two kinds of comparisons possible to evaluate the effects of the 
educational methods used, First, an intra-group comparison 
consisting of noting differences in the same group at the beginning 
and end of the semester; and, second, the inter-group comparison 
of noting the differences among the four groups studied. 

Since any conclusions based upon this investigation are depend- 
ent in good part on the experimental variables introduced, it 
would seem necessary to offer some description of the differing 
methods used in treating the subject matter of the course. 


TRADITIONAL LECTURE DISCUSSION METHOD 


The procedure and plan of instruction followed with the lecture- 
discussion section is familiar to most college graduate and under- 
graduate classes. The instructor had rather carefully-planned 
lecture material which he presented to the class in a more or less 
systematically directed fashion towards the end of the covering a 
a given amount of pertinent factual material. Student participa- 
tion was usually limited to isolated questions and even more 
isolated comments regarding the factual material presented. 
great deal of note-taking was commonplace, with the usual cram 
Sessions for mid- and end-term examinations. The following at 
Tepresentative class materials presented: 
=) 
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E? 
A: 


` Third Meeting—Heredity-environment including genetic ap- 

proach, cultural-anthropological approach, history of controversy, 
_ studies, present thinking. 

Twelfth Meeting. —Husband-wife relationships including author- 

ity and responsibility, the career woman, economic factors, in- 


Jaws, physiological factors and divorce. ë 
o 


GROUP THERAPY METHOD 


The type of play drama used with this class was a modification 
of the psychodramatic and sociodramatic sequences developed and 
‘used by Moreno (5) and Jennings (4). During the first class mget- qe 
ing the students were asked to list five everyday situations either * 
at home or school that caused them embarrassment, created 
tension, or evoked displeasure, anger or unhappiness. The students 
were then asked to indicate whether they would care to appear in 
alittle play built around this situation which would be presented 
to the group. 

__ Many of the dramas selected for class use were drawn from those 
presented by the students. The criterion for selection, of necessity, 
was their applicability to the material of the course. Where there 
were none which would adequately touch on course content, the 
investigator created one appropriate to the topic of discussion. 

Fach play-drama was structured by the instructor, but the 
students were encouraged to select their own rôles and.play them 
out spontaneously, making up lines as they went along. The 
procedure usually followed was to permit the players to terminate 
their own scenes, with a half-hour being arbitrarily selected as a 
maximum time limit. Following the drama, the ‘audience’ was 
then invited to comment on any phase of the drama that to them 
invited comment. Concentrated effort was made to keep the two 
courses fairly similar in terms of subject matter treated and in the 
amount of time devoted to each area covered. Hence the subject 
of prejudice itself was confined to one class session in each class. 

Examples of specific play-dramas used in connection with the 
development of course content paralleling the examples selected 
from the traditionally conducted class are as follows: : 

Heredity-Environment.—A young couple who have been seemg 
each other for the past year decide that they are deeply in love 
and wish to marry. The fly in the ointment, however, is that the = 
boy’s father is serving time in the federal penitentiary fop crimi- 
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nal offense. The young man’s conduct and behavior have always 
been examplary, but the young lady is aware of the strong objec- 
tions of her parents to this contemplated marriage. They never- 
theless decide to have the matter out with the girl’s parents that 
night. 
Husbgnd-Wife Relationships—a) The younger brother never 
@ seems to be able to get dressed early enough to get to school on 
time. Father, in a fit of anger at this recurrent situation begins to 
argue with mother. b) Mother, unable to cope with the children 
asks father to do something about it. Father answers by declaring . 


a it’s not his responsibility. 


ip 


RESULTS 


Table I presents the mean attitude scale scores, variability and 
t ratios of the therapy groups at the beginning and end of the 
semester. There were three attitude scale variables on which the 
significance of changes were tested. These were the E, F, and E-F 
scales. The E-F score was the combined prejudice score as meas- 
ured by the entire attitude scale. 

Inspection of Table I reveals that differences obtained from 
pre- to post-testing were significant for the group therapy class. 
More specifically, changes occurring on the E scale might be 


TABLE JMuansy STANDARD DEVIATIONS AND COMPARISONS OF PRE- 
AND POST-TESTS ON THE E,F anp E-F SCALES ror 
THE THERAPY GROUP 


Pre-test Post-test Level of 
Scale N MM F «| Significe 
M 5 M + ance 
E 27 | 65.4 | 13.04] 60.9 | 15.1 | 4.22 | 1.33*| 2.06 | .05 
F 27 | 122.1 | 23.36 | 105.4 | 22.7 | 15.92 | 1.06*| 5.02 | -01 
E-F | 27 | 187.1 | 31.57 | 167.1 | 30.0 | 20.46 | 1.10*| 6.46 | .01 


Nors: Using the small sample techniques and entering the “t” table 
with 26, 27, and 28 (as the case may be) degrees of freedom the ratio 18 
significant at the .05 level if it is over 2,00 and at the .01 level if it is over 
2.65 (H. Garrett, Statistics in Psychology and Education”, New York: 
Longman, Green and Co., 1947, p. 190). d 

‘The difference between each pre- and post-test score was calculate 
for each person on each item by subtracting the latter from the former: 
Means and standard deviations of these differences were then compared. 

*F igpot significant at the .05 level of confidence. 
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‘expected, due to chance, in only five out of one hundred cases: 

changes in the F schle in only one out of a hundred cases, and for 
the combined scale significance is also well established in that:such 

" changes as occurred would probably occur by chance in only one 
out of one hundred cases. Inspection of F results reveals no signifi- 
cant difference in variability. € 

Such changes as did occur on the E, F and combined E-F scales 
for the lecture-discussion, Control Groups I and IT were not statis- 
tically significant. With the exception of Control Group II, F and 
combined E-F scales, the’ direction of change was towards a re- 
duction of prejudice. Changes in variability were likewise mot @ ig 
significant. 
Table II presents means, variability and t ratios on the Index 

for the experimental group at the beginning and end of the semes- 
ter. In the therapy group, Scale I, which is an expression of the 
individual self-concept, increased significantly in that the changes 

~ that occurred could be attributed to chance factors but five times 

? out of one hundred. Scale II, which is an expression of the accept- 

"ing or rejecting nature of the feelings the individual holds towards 

Í self, increased even more significantly in that the differences were 
significant at the .01 level of confidence. Scale III, representing 
the expression of the ideal self, was raised but not to any degree 
of statistical significance. Scale III-I which Bills has termed the 

_ discrepancy index and which relates to the discrepancy between 

the individual’s ideal self and his self-concept has been described 


Tanin Il.—Muans, STANDARD DEVIATIONS AND COMPARISONS OF PRE- 
AND POST-TESTS ON THE INDEX OF SELF ADJUSTMENT 
FOR THE THERAPY GROUP 


Pre-test Post- Level of 
Scale | N j cette aad or e Signe 
M e M o 
I 27 | 178.67] 14.38 | 184.61] 13.91] 6.09 | 1.06% 2.11 05 


i 27 |189.17| 21.84 | 202.00| 18.90| 13.33 | 1.33*| 3.76 .01 
Tr 27 | 194.44] 16.35 | 196.46] 13.71] 3.72 | 1.42*) 1.48 NS, 
ULI | 27 15.72| 15.10 | 13.11] 10.27|-1.18 | 2.16*| -51 NS. 


*The difference between each pre- and post-test score was calculated 


t 
for each person on each item by subtracting the latter from the former. 


ous F and standard deviations of these differences were then compared. ð 
is not significant at the .05 level of confidence. @ 
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as an index of maladjustment. Positive movement in psycho- 
therapy should be accompanied by reduction*in the discrepancy 
index. Results for the group therapy class indicate this decrease 
in scale II-I but not to any degree of statistical significance, 

The data obtained for the lecture-discussion group reveal that 
such changes as were effected did not attain or approach statistical 

® significance. Although the means of Scale I did increase some- 
what, means for Scales II and III decreased, The Discrepancy 
Index moved in the direction of better adjustment but did not 
approach significance. 

As in the lecture-discussion group the data for Control Group I 
indicate that changes effected were not statistically significant. 
Movement on Scales I, II and III were all in the direction of 
decreased means representing a decline in feelings of self-worth, 
self-acceptance, and of ideal self. Movement in the Discrepancy 
Index was in the direction of greater adjustment, but did not 
approach statistical significance. 

The data obtained for Control Group II reveal that with the 
exception of Scale IIT changes effected did not approach statistical 
significance. Movement on Scales I and II was in the direction of 
a decrease in self-worth and self-acceptance. Significant, however, 
at the .01 level of confidence is the movement of this group in the 
direction of a lowered ideal self, The discrepancy index was in the 
direction òf greater adjustment but did not approach statistical 
significance. à 

Table III presents the intra-group changes from the beginning 
to the end of the semester in the experimental group along with 
the “t” ratios testing the significance of these changes. Only two 
of the twelve variables in the group therapy class reached signifi- 
cance at the .05 level of confidence, Pd (Psychopathic deviate) 
and M (Masculinity-Femininity) at the .01 and .05 levels, respec- 
tively, although the Pa (Paranoia) and the K (Correction factor) 
did approach significance. Table III also shows that the direction 
of movement on the L (Lie), Hs (Hypochondriasis), D- (Depres- 
sion), M (Masculinity-Femininity), Pt (Psychasthenia) and S¢ 
(Schizophrenia) was toward decreased scores. Movement: on S 
K (Correction factor), F (Validity scale), Hy (Hysteria), P 
(Psychopathic deviate), Pa (Paranoia) and Ma (Hypomania) 
was toward increased scores, ; 

~A the lecture-discussion class four of the twelve variables 
>, 
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cy 
Tapun Tll.—Mezans, STANDARD DEVIATIONS AND COMPARISONS OF PRE- 


AND Post-rests ON THE M.M.P.I. FOR EXPERIMENTAL 
CONTROL Groups. (A) Group THERAPY 


Pieter | Postet | Chang betegen | ae 

Scale Se? Signific- 
° ance 

i M o M o M o 

A LARDA o 
L | 54.95| 6.58 |54.5 | 5.89| —.25| 589| .21 | NS. 
K |59.02| 7.29 | 61.39| 9.13| 2.20| 6.77 | 1.65 | .20 
E | 53.22| 4.67 | 53.61 | 6.70| .35| 5.99 | .29 | NS. 
‘Hs |54.43| 8.78 | 52.87 | 6.23 | —1.13 | 8.02 | .71 | N.S. 
D |53.89]|10.05 | 52.33 | 9.50 | -1.21 | 5.30 | 1.16 | NS. Si 
“Hy |55.11| 9.15 |57.09| 6.75 | 1.83 | 6.49 |*1.43 | N.S. 
Pd |54.78| 9.22 | 60.44 | 10.45 | 5.44 | 9.93 | 2.79 | .01 
M |52.94| 6.36 |49.54| 8.19) 3.45] 7.20 | 2.41 | .05 
Pa |49.31| 6.51 |53.00| 8.83| 2.79| 8.09 | 1.75 | .10 
Pt |54.28 | 10.38 | 52.94 | 8.75 | —1.62 | 10.41 | .79 | N.S. 
Be | 55.61) 6.24 | 55.46 | 6.96 | —.32 | 6.89 | .23 | NS. 
Ma |53.44| 8.25 | 56.33 | 10.01 | 2.98 | 11.24 | .14 | NS. 


Nora: Using the small sample techniques and entering the “‘t?’ table 
with 26, 27, and 28 (as the case may be) degrees of freedom the ratio is 
en at the .05 level if it is over 2.00 and at the .01 level it it is over 

1The difference between pre- and post-test scores was calculated for 
each person on each item by subtracting the latter from the former. Means 
and standard deviations of these differences were then compared, 


Tevealed significance at the .05 level of confidence. These were in 
K (Correction factor) .01 level, Hs (Hypochondriasis) .05 level, 
Hy (Hysteria) .02 level and Sc (Schizophrenia) .05 level scales. 
The direction of movement of these was towards increased scores, 
as it was also on L (Lie), K (Correction factor), F (Validity scale), 
Pd (Psychopathic deviate) and Pa (Paranoia), though the latter 
Were not significant. Decrease in mean scores occurred in the D 
(Depression), M (Masculinity-Femininity) and Ma (Hypomania), 
but these changes were likewise not statistically significant. 

Only one of the twelve variables in Gontrol Group I reached 
Significance at the .05 level of confidence, the F (Validity scale). 
Movement towards a decrease in mean score occurred in this and 


also in the Hs (Hypochondriasis), D (Depression), Hy (Hysteria), 


M (Masculinity-Femininity), Pa (Paranoia), Pt (Psychasthenia), 
and Sc (Schizophrenia) scales. Movement towards igereased 
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scores occurred in the L (Lie), K (Correction factor), Pd (Psycho- 
pathic deviate), and Ma (Hypomania). s 
None of the variables in Control Group IT attained significance 
at the .05 level of confidence. Movement towards increased scores 
took place in the L (Lie), F (Validity scale), Hs (Hypochondriasis), 
Hy (Hysteria), M (Masculinity-Femininity), Pa (Paranoia), Pt 
® (Pychasthenia), and Se (Schizophrenia) scales. Movement towards 
a decrease in score occurred in the K (Correction), D (Depression), 
Pd (Psychopathic deviate), and Ma (Hypomania) scales. 
In interpreting these data it should be noted that both pre- and 
æ «A post-testing scale means for all groups did not reach abnormality. 
Thus, while it is possible to discuss significant scale shifts in 
relation to pre-test scores, it must be stressed that there were 
actually no significant changes in total adjustment for any of the 
groups on any of the scales of the MMPI. The significance of 
scale shifts towards increased means can therefore be considered 
of extremely tentative significance with relation to the clinical 
entity each scale is designed to measure. Decreases in means can 
generally be disregarded in view of the small range of deviation. 
The Group Therapy Class had as many shifts in mean seale 
scores in the direction of maladjustment as it had in the direction 
of greater adjustment. The Pd (Psychopathic deviate) scale, 
which purports to measure a disregard of social mores, moved 
significantly in the direction of decreased adjustment, while the 
+ M (Masculinity-Femininity) scale moved significantly in the 
opposite direction. A possible interpretation of the Pd (Psycho- 
pathic deviate) scale shift is that the group responded to the 
therapeutic climate by an increased tendency to flout social mores 
or conventions. A plausible speculation, in view of the fact that 
the group mean did not approach borderline significance on the 
psychopathic deviate scale, is that the movement may be inter- 
preted as a shift away from conventional behavior and a decrease 
in the repressive aspects of super-ego motivation. The plausibility 
of this speculation is further supported by the F (Validity scale) 
results where it was seen that the group significantly declined m 
expression of rigidly-held conventional attitudes. Supportive, to 
are the data with regard to the heightened self-concept and self- 
acceptance indices of this group. It is logical to assume that mov 
é ment in this direction may be permissive of greater felt freedom 
tdp from the expected behavior pattern. As indicated abov® 
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the decrease in F (Validity scale) means may, for all intents and 
be disregarded. 

` Scale results for the Lecture-Discussion class present. a rather 
perplexing picture. Movement in the direction of pathological 
behavior occurred on the K (Correction), Hs (Hypochondriasis), 
Hy (Hysteria) and Se (Schizophrenia) scales. Strictly interpreted, 
it would seem that this class responded to the lecture-discussion® 
approach of the investigator by tending to move in the direction 
of greater defensiveness, somatic complaints, and withdrawal 
behavior. A highly tentative speculation concerns the dual rôle 
played by the investigator. It is conjectured that the possible W> 
effects of this double identity may have contributed to the genera- 
tion of more than the usual amount of anxiety caused by classroom 
practice and that this was manifested by characteristic college 
student defenses such as increased defensiveness, somatic pre- 
Occupation, and greater intellectualization. 

With the exception of a significant increase in the F (Validity 
seale) for Control Group I there was no significant movement for 
both controls, The high F (Validity scale) would suggest that the 
results for this group may be of questionable validity. 

Table IV presents the class means, deviations, and ranges for 
both groups taught by the investigator, e.g., group therapy and 
lecture-discussion, on both mid- and end-term objective examina- 
tions. Examination of Table IV clearly indicates that while the 
Lecture-Discussion group did slightly better on both mid- and « 
end-term examinations these differences are statistically insignifi- 


Taste IV.—A Comparison ov Content EXAMINATION RESULTS FOR 
Group Tuerary anv Lecrure-Discusston CLASSES 


Mid-Term End-Term 
—— ra a |e 
oo 81.51 | 5.51 | 70-90 80.97 | 7.74 | 70-91 
18 18 
Lecture-Discus- | 81.76 | 4.50 | 68-96 82.40 | 6.47 | 70-93 
sion (25) 


Nore: Using the small sample techniques and entering the “t” table 
With 56 degrees of freedom the ratio is significant at the .05 level if it is " 
Over 2.00 and at the .01 level if it is over 2.65. o 
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cant. It is interesting to note that grade scores remained constant 
from the beginning to the end of the semester. 

The general conclusions drawn from a statistical analysis of the 
data of this study are as follows: 

1) The major purpose of the study was accomplished in the 
finding that a course in practical psychology conducted by using 
such group therapy procedures as student-centered classroom 
practice and sociodrama measurably modified attitudes associated 
with racial, religious, and ethnic prejudice. 

2) A course in practical psychology when conducted along 


=; MM traditional lecture-discussion methods of instruction did not 


significantly modify these same attitudes. 

3) The factor of attendance at Brooklyn College within the 
Vocational Diploma Program for the experimental period did not 
produce significant modification of these same social attitudes. 

4) Although statistically significant changes took place on both 
E and F scales for the experimental group only, statistical analysis 
of the data revealed that the attitude measured by the F scale 
(covert attitudes of prejudice) was modified more significantly. 

5) Inspection of the variable changes comprising both E and F 
scales revealed that the most significantly effected scale clusters 
were the conventionality and the authoritarian agression items. 
Scale variables approaching significance were the authoritarian- 
submission and the superstition and stereotype items. 

6) The data on the Index of Adjustment and Values were 
generally supportive of the attitude-scale findings. Significant 
increases in the Self-Concept (Scale I) and Acceptance of Self 
(Seale IT) took place for the Group-Therapy class only. The dis- 
crepancy index (adjustment index III-I) while moving in the 
direction of greater adjustment did not approach significance. 3 

7) The Minnesota Multiphasic Inventory did not reflect readily 
understandable results. Significant movement took place within 
the Group Therapy class on the Pd (increased) and M (decreased) 
variables. The Lecture-Discussion class shifted significantly upw® 
on the K, Hs, Hy, and Se scales. Movement within the cont 
groups was limited to a significant increase in the F variable for 
Control Group I. It appears that the MMPI when used with # 
Population of ‘normal’ individuals may not be the ideal instrument 
to measure movement towards greater or lesser adjustment within 


~a nari frame of reference. 
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_ 8) A subsidiary finding for the two groups taught by the in- 


vestigator was the practical similarity of course content test 
results on both mid- and end-term examinations. 


IMPLICATIONS 


The results of the present study have rather clearly? demon- 
strated the relative superiority of therapeutic classroom techniqu 

in modifying significantly inter-group prejudice. In doing so it has 
supported the findings of those investigators who believe prejudice 
to be but a symptom of a personality need syndrome, which is not 


readily amenable to modification by factual information, exhorta W 
tion, propaganda or other simple linear approaches. It further 


supports the view of such clinical educators, if such they may be 
called, as Rogers, Cantor, Snygg, Combs, Shedlin and Kilpatrick, 
to mention but a few, who insist that education is only truly 
effective in the degree that the individuality and integrity of the 


student is truly respected. This education is based on the assump- 


tion that the student can be trusted to learn in every way which 
will maintain and enhance self, and he can be trusted to grow, 
provided the atmosphere for growth is available to him. 

It has been indicated by this research that non-directive and 
Sociodramatic techniques have contributed towards a total self- 
development that was reflected in increased self-insight, greater 
self-acceptance and a decrease in attitudes associated with racial, 
religious, and ethnic prejudice. Exemplary of much of this growth 
are several excerpts ‘drawn from course evaluation forms submitted 
by students in the group therapy class: . 

“The most important thing that I have gained is self-confidence 
and a feeling that I am not the only person who feels insecure, 
hesitant, doubtful and quite often frustrated. I have a fuller, more 
rounded outlook on life. I see my friends, relatives and teachers 
as human beings and friends rather than scientists with micro- 
Scopes waiting to study me. I feel, that, now, we are all alike. I 
find that I am unconsciously more sympathetic toward people and 
their troubles, more friendly and can certainly hold a more intelli- 
Gent conversation due to the knowledge I learned this term—and 
it was plenty.” 

“As a result of this course I think I have gained a broader 
outlook on the ‘whys’ and ‘hows’ of the behavior patterns of 
People around me. It has changed my approach to on 
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living because of the technique that was used, especially those of 
the dramas which were used and which depicted the same every- 
day situations most people have gone through.” 
“T have gained quite a bit from this course. I have learned that 
most people don’t do things just to be nasty but because of some- 
thing th&t bothers them. It has taught me to have patience with 
my friends and relatives when they are upset and annoyed, I have 
further learned to control my temper.” 


REFERENCES 


AP »T.w. Adorno, E. Frenkel-Brunswik, D. J. Levinson, and R, W, San- 
ford. The Authoritarian Personality. New York: Harper and Brothers, 1950, 

2) C. Rogers. Client Centered Therapy. Boston: Houghton Mifflin Co., 
1951. 

3) H. S. Maas. “Applying group therapy to classroom practice.” Mental 
Hygiene, 35 (1951), 250-259. 

4) H. H. Jennings. “Sociodrama as an educative process.” Fostering 
Mental Health in our Schools. Washington, D. C., National Education Associ- 
ation, 1950, 

5) J. L. Moreno. “Experimental sociometry and the experimental method 
in science,” in Current Trends in Social Psychology, Pittsburgh: Univ. of 
Pittsburgh Press, 1948, 


ağ 


ATTITUDES OF PARENTS OF DIFFERING 
SOCIO-ECONOMIC STATUS TOWARD THE 
TEACHING PROFESSION’ 
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e 
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The purpose of the present paper is to report on an investigation 
of the relationships between socio-economic status and parental 
attitudes toward the teaching profession. The following a 
reported here: (1) The prestige of teaching occupations; (2) The 
suitability of the teaching occupations for sons and daughters; 
(8) The desirability of the teaching occupations for sons and 
daughters. 

SUBJECTS AND METHODS 

Expressions of attitudes were obtained by asking the parents of 
a selected group of high school pupils to rank three teaching 
occupations (grade school, high school, and college) along with a 
selection of other occupations representing a wide socio-economic 
range. The rank assigned the teaching occupation is accepted as 
a measure of attitude. 

Measuring attitudes toward the teaching profession.—A question- 
naire was constructed to facilitate the investigation of parental 
attitudes toward the teaching profession. In constructing this 
questionnaire the fifst step was to select two lists of occupations, 

„ one for sons and the other for daughters which (1) were cgmmon 
to Towa, (2) had a comparatively large number of workers, and 
(8) ranged from unskilled to professional. For the part of the 
questionnaire dealing with prestige, ninety-three occupations were 
used. For the parts dealing with suitability and desirability, 
forty-one occupations were used for sons and twenty-nine for 
daughters. * 

Thstructions to parents were in part as follows: (for prestige) 
“Some occupations seem more important than others. Perhaps 

of us respect some occupations, and the men and women in 
them, more than we do others. Place a figure 1 in front of the 
—_—_—+_ 


1 The article is based on the writer’s Doctorate dissertation, State Uni- 
versity of Towa, 1952. Grateful acknowledgment is expressed to Professor ó 
James B. Stroud under whose direction the work was done. ® 
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occupations you have the highest respect for... . Place a figure 5 
in front of the occupation you have very low respect for. You 
may assign the same rating to as many occupations as you like.” 

The following are the instructions employed for suitability: 

“In this part of the questionnaire we are asking you to think 
about eich occupation and to judge how well it is suited to your 

son (or daughter). . . . In light of your son’s (or daughter’s) abilities 
and qualities which occupations is he (or she) best suited for? 
Please remember that he may be well suited to more than one 
occupation.” Place a figure (1) in front of the occupations which 
aresvery highly suited to your son’s (or daughter’s) abilities and 
qualities. .. . Place a figure (5) in front of the occupations which 
are very poorly suited to your son’s (or daughter’s) abilities and 
qualities, 

Parents were asked to rate the same lists of occupations (for 
sons and daughters) as to desirability again on a five-point scale. 
Here parents were asked to make the ratings in terms of how well 
they would like to see their son or daughter enter them. 

Characteristics of the sample—The questionnaire was adminis- 
tered to parents of certain of the six hundred and sixty-six tenth- 
grade students in the four public secondary schools of Cedar 
Rapids, Iowa, inthe spring of 1952. 7 

By use of the occupation of the wage earning parent, which A 
most cases was the father, and the address, an estimate of the social 
status of each pupil was made. This amounted to a preliminary 
screening. Each pupil was placed in one of four lists based on 
this initial social class estimation: upper, upper-middle, lov 
middle, and lower. From each list a randomly drawn tentative 
sample was selected. To this sample the Index of Social Class 
was applied. When necessary adjustments were made, the 


sample appeared as follows: . 
Daughter Bs 
Father Mother 
| 
Upper-class 15 15 60 
Upper-middle 15 18:-- poi 
Lower-middle 1 | 15 a0 
Lower-class 15 15 60 


|__—_ 
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"In ninety-five families both parents coéperated, while in fifty 
‘families only one ‘parent filled out the questionnaire. 
Measuring the socio-economic status.—The method selected for 
‘determining socio-economic status was the Index of Status Char- 
acteristics, 1S.C., developed by Warner, Meeker and Eells. The 
_ formula employed for this estimation of social status is a Weighted 
combination of values assigned to (a) Occupation, (b) Source of 
‘income, (c) House type and (d) Dwelling area. 
Each parent selected in the tentative sample was personally 
contacted by the writer. This contacting interview had as its 
aim the establishing of sufficient rapport to bring about acceptance 
of the questionnaire. During the interview, information needed 
‘to complete the 1.S.C. was gained. Where needed, corrections 
“Were made as indicated above. A date was set for personally 
picking up the completed questionnaires. If upon returning to 
parent the questionnaire was not completed a second date was 
__ set for pick-up purposes. 
j K Coöperation was furthered by the assistance of the superinten- 
‘dent of schools who wrote a letter of introduction which was 
ra shown to the parents by the interviewer. In these ways, codpera- 
tion was obtained from ninety-two per cent of the families con- 
— tacted. , 

y RESULTS 

Prestige of teaching occupations.—The mean ratings on prestige 
for the three teaching occupations are presented in Table I. To 

à facilitate interpretation the mean ratings of several other occupa- 
tions varying widely in prestige are given. In both instances 
Separate means are listed for the four social classes. The data 
are for sons only. 

Noteworthy is the fact that ratings of occupations do not vary 
much with social class. This finding stood out in the entire list 
(ninety-three) of occupations, suggesting that the various occupa- 
tions are held in about the same degree of esteem by parents of 

the four social class levels. Mickey obtained a similar result 
When he asked ninth-grade students to rate occupations as to 
prestige.’ 

1 

*W. L. Warner, M. Meeker and K. Eells, Social Class in America, Chi- 


cago: Science Research Associates, 1949, pp. 121-159. l; 7 
E. C. Mickey, High School Student Rating of Occupational reita 
Q 


Master’s thesis, State University of Iowa, 1948. 
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TABLE I.—MEAN PRESTIGE VALUES OF OCCUPATIONS ror Sons BY 
Socran Crass or RATERS ò 


Upper- 
Occupation Upper-class middie aeaa Lower-class 


Sutgeon 

kad College President 
Lawyer 
College Teacher 
Supt. of Schools 
High-school Teacher 


Veterinarian 
Factory Manager 
Rural School Teacher 
Policeman 

Bank Teller 
Lumber Dealer 
Carpenter 
Hardware Dealer 
Mail Carrier 
Plumber 

Traveling Salesman 
Sales Clerk 
Delivery Man 
Waiter 

Bartender 


SCOMRSHRONDUDANSONDOUUWO 
2 


AHKaANdS ROUND RH ADOONNAA YDS 
PE SVS NN BO BO bo DO BO BO Bo EEE Eee 
EREE RERERRETEFETETREETTEES 


H a O O9 Oo BO bO BODIO D D O D DO i DD eee 
W o o oa O O O O O O O DO DO EPH pet ee ee 


Wa oo a O9 9 09 69 O9 O9 OO BD O9 B9 pi pi pei pmi pd pad pe ee pe 
NWMWOPSONROHNOTOCHROBDHOORBESHOS 


Also significant is the comparatively high prestige ratings 
accorded the teaching occupations by all social classes. ‘This 
finding contrasts with the fact that important social class differ- 
ences were found when parents were asked to rate occupations on 
suitability and desirability. 

Suitability of teaching occupations for sons and daughters.—Do 
the views expressed by parents regarding the suitability of teach- 
ing for their sons and daughters vary with socio-economic status? 
The data in Table II show that the two upper classes rated all 
three of the teaching occupations as being more highly suited than 
did the two lower classes. The two lower classes rated teaching 
less well suited to their sons and daughters. In fact, except se 
lower class father’s attitudes (as expressed) toward grade-scho0 
teaching for daughters it can be said that parents of the two lower 


<H Degard teaching as poorly suited to their children. oi 


D 
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TABLE II.—MEANS AND STANDARD DEVIATIONS OF THE RATINGS OF THE 
SUITABILITY or TEACHING OCCUPATIONS FOR Sons AND DAUGHTERS OF 
Dirrerine Socio-Economic STATUS 


College High School Grade School 
Father Mother Father Mother Father Mother 
© 

Us M & (SD) | 2.7 (1.1) | 3.0 (1.4) | 2.3 (1.0) | 2.6 (1.2) | 2.7 (0.8) | 3.1 (1.2) 
U:D M & (SD) | 2.7 (1.1) | 2.3 (1.2) | 2.3 (1.1) | 2.3 (1.2) | 2.7 (0.9) | 2.4 (1.0) 
UM: 8 M & (SD) | 2.8 (1.4) | 2.9 (1.3) | 2.5 (1.2) | 2.5 (1.4) | 2.6 (1.3) | 2.9 (1.3) 
UM:D | M & (SD) | 3.3 (1.2) | 3.4 (1.2) | 2.9 (1.1) | 2.9 (0.9) | 2.5 (1.0) | 2.2 (0.9) 
IM:8 M & (SD) | 3.6 (1.3) | 3.8 (1.2) | 3.3 (1.5) | 3.1 (1.2) | 3.4 (1.5) | 3.1 (1.3) 
IM: D M & (SD) | 3.5 (1.5) | 3.7 (1.4) | 3.1 (1.8) | 3.5 (1.4) | 2.7 (1.0) | 3.3 (1.8) 
Ls M & (SD) | 3.9 (1.2) | 3.9 (0.8) | 3.6 (1.5) | 3.8 (1.2) | 3.7 (1.5) | 3.9 @.9) Ts 
L: D M & (SD) | 3.5 (1.0) | 3.6 (1.4) | 2.9 (1.1) | 3.8 (1.5) | 2.8 (1.1) | 3.2 (1.3) 
Us Upper class parents. 
5 Sons, ete, 


Nore: Due to the instructions employed in the questionnaire the lower the mean the higher 
the attitude of suitability. 


special interest is the high rating given high-school teaching by 
the two upper classes, the second and fourth highest places among 
the forty-one occupations. In the two lower classes, high-school 
teaching was ranked nineteenth and thirty-first with grade-school 
and college teaching ranking even lower. 

Mothers of the several classes differed from fathers in that they 
judged all three teaching occupations a little less suitable for their 
sons—with the exception of the mothers of the lower-middle class. 

As a means of determining the statistical significance of social 
class differences in ratings, and differences among the ratings of 
the three occupations, analyses of variance (f test) were computed. 
| Examination of these data on the fathers’ ratings revealed that 
the differences among the social classes concerning the suitability 
of the three teaching occupations for their sons were statistically 
significant at the five per cent level.! The analysis of variance 
showed that differences in the ratings of the three teaching occupa- 
tions were also significant. 

_ Mothers of the four social classes differed significantly (f test) 
in rating the suitability of the teaching occupations for their sons. 
This analysis also revealed that mothers did not judge the three 
teaching occupations as equally well suited for sons but rated 


—_ 
i “Hereafter, when significant difference is used it refers to a difference r 
that is statistically significant at the .05 level or better. ® 
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high-school teaching as significantly better suited than either 
grade-school or college teaching. è 
Parents with daughters judged the suitability of the three 
teaching occupations differently than did parents with sons. As 
can be seen in Table II, college teaching is strictly an upper class 
occupation; only this social class rated it as well suited for daugh- 
ters. A much different attitude exists about the suitability of 
grade-school teaching; fathers of all four social classes rated this 
teaching occupation above average while mothers of the more 
favored environments rated it highly suitable and mothers of the 


aitwa lower classes rated it slightly below average. 


fi 


With respect to high-school teaching, parents of the upper class 
rated this teaching occupation highly suitable for daughters while 
the upper-middle class rated it somewhat lower but still above 
average. Fathers of the two lower classes rated it average in 
suitability, while mothers judged it somewhat below average. 

Once again mothers differed from fathers in judging the suita- 
bility of the three teaching occupations. Ratings of mothers of 
the upper class were higher than those of fathers, but in the less 
favored environments, the opposite trend was noticed. 

The analysis of variance of the ratings of fathers with daughters 
revealed statistically significant differences in the way the three 
teaching occupations were rated, but non-significant social class 
differences. Analysis of variance of mothers’ ratings revealed 
both statistically significant social class and occupational differ- 
ences, 

Desirability of teaching.—Parents of different socio-economic 
status were asked to think less realistically and to judge just how 
much they would ‘like’ to see their son or daughter enter the 

. occupations previously rated as to suitability. 4 

Means and standard deviations of ratings of the teaching occu- 
pations as to desirability are found in Table III. Since the 
differences between the ratings of mothers and fathers were 8? 
small, the attitudes of parents have been combined. 

The only difference that is significant among the ratings of the 
upper and lower social classes on desirability is that for the cate- 
gory—college teaching. Among the forty-one occupations Ta a 
on desirability for sons college-teaching was ranked the fifth an 
third most desirable by the upper and upper-middle classes 


eee, . Highschool teaching was rated ninth and tenth 
P : 
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VS AND STANDARD DEVIATIONS OF THE DESIRABILITY OF 
OCCUPATIONS ror Sons AND DAUGHTERS OF DIFFERING 
Socio-Economic STATUS 


College High School Grade School 


Son |Daughter| Son |Daughter| Son® |Daughter 


2.8 
1.2 


desirable by these same classes and was judged more desirable 
such professional occupations as newspaper reporter, drafts- 
insurance agent and pharmacist. Although rated somewhat 
grade-school teaching was judged more desirable by these 
classes than all the skilled occupations. This was also true 
lower-middle class with the exception of electrician, carpen- 
machinist, which were judged more desirable than grade- 
teaching. The upper-class judged college-teaching to be 
st desirable for daughters of all twenty-nine occupations 
, with high-school teaching judged more desirable than such 
ional occupations as office manager, registered nurse, social 
stenographer, purchasing agent and newspaper reporter. 
‘upper-middle class judged grade-school teaching to be the 
most desirable for daughters of all occupations rated, with 
chool and college teaching rated only slightly lower. The 
lower classes rated teaching only slightly less desirable for 
hters than did the favored classes, although they rated cer- 
business occupations (stenographer, typist and bookkeeper) 
slightly more desirable than teaching. Other business 
tions were rated considerably less desirable than teaching. ä 
an interesting observation that although a: a 
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come from the lower and lower-middle classes® (probably making 
them about average proportionately with respéct to social class), 
parents of these two classes, as suggested by this investigation, 
appear to regard teaching as a less suitable and less desirable 
occupation for their sons and daughters than do parents in the 


atwo upper classes. 


CONCLUSIONS 


1) In rating occupations as to prestige value the parents of this 
sample showed little difference one social class from another. 

can ratings for the four social classes were remarkably similar 
for the four classes. All three of the teaching occupations dealt 
with in this paper ranked relatively high on the prestige scale. 
Among the ninety-three occupations the composite rank value on 
a five-point scale for college teaching was 1.6 (1.1 being highest); _ 
for high-school teaching, 1.8; and for grade-school teaching, 1.9. 
(Eight occupations ranked higher than college teaching, in this 
order: surgeon, physician, college president, judge, clergyman, 
architect, dentist and electrical engineer.) j 

2) Ratings on suitability of the teaching occupations varied 
with social class status. Parents of the upper two classes rated | 
the teaching profession more suitable for their sons and daughters 
than did parents of the lower two classes. 

3) Ratings on desirability of teaching as an occupation also 
varied with social class status, although to a lesser degree, except 
for college teaching. ; 

4) Ratings on the suitability of the three teaching occupations 
varied as to the sex of the pupil. For all social classes combined, 
parents judged high-school teaching to be significantly more | 
suitable for sons than the other two teaching occupations. The 
upper three classes rated college teaching as being more desirable 
for their sons. With the exception of the upper class, grade- 
school teaching was judged to be better suited to daughters than 
the other two teaching occupations. It was also rated as being 
much more desirable for daughters than for sons. * 

*Nathan A. Edwards, Sociology of Teaching. II: A Study of the Male 
Classroom Teacher, Doctor’s dissertation, State University of Iowa, 1952. 
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COMPARATIVE DECLINE OF GRADUATE 
RECORD EXAMINATION SCORES AND 
INTELLIGENCE WITH AGE 


4 
R. TRAVIS OSBORNE and WILMA B. SANDERS © 


University of Georgia 


From the literature (3, 4, 7, and 8) it is possible to infer that 
adult age increments are accompanied by a decline in acquired 
knowledge which would parallel the decline in intelligence with 
age. However, with the exception of Learned and Wood’s The 
Student and His Knowledge (8) and a previous study by the authors 
(4) there is almost a complete lack of direct data on the compara- 

tive decline of acquired knowledge and intelligence of adults. 
This investigation is for the purpose of securing research evidence 
relevant to this problem. 


POPULATION AND PROCEDURE 


Tn the present study the Graduate Record Examination was 
used for determining the student’s level of acquired knowledge. 
This test according to the G.R.E. prospectus “is the only examina- 
tion of national character which is expressly designed to measure 
achievement in the principal fields of collegiate study at the 
advanced level and which is endorsed or required by graduate and 
Professional schools” (2). Actually the G.R.E. is a series of 
subject-matter tests designed to cover broadly the main areas of 
the typical undergraduate program of instruction: Mathematics, 
Physics, Chemistry, Biological Sciénce, Social Science, Fine Arts, 
Literature, and Verbal Factor. The latter test is not a content 
test in the true sense of the word but it was developed and included 
in the G.R.E. battery to measure the ability to discriminate word 
Meanings, 

The one thousand eight hundred and seven graduate students 
Who were in’attendance at the University of Georgia between 
February, 1946, and February, 1952, were subjects for study. 

group includes all graduate students except some few irregular 
Students who were not seeking an advanced degree. The seven 


Professional schools and the five divisions of the College Sh Arts 


and Sciences were represented. Men students out-n 
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women; one thousand one hundred and seventy- three to six 
hundred and thirty-four. 

G.R.E, scores, personal data, and college marks were available 
from the files of the Dean of the Graduate School. Machine 
methods were used for processing and conventional statistical 

o procedfires were used in analyzing the data. 

Figures 1 and 2 compare graphically G.R.E. age-score and 
intelligence curves. The two sets of data were not obtained from 
the same subjects; the age-achievement curve was plotted from 
the results of G.R.E. and the intelligence curve from Wechsler’s 
ndérmative data, For comparative purposes these curves have 
been reduced to approximately the same scale. 

The number, mean, and standard deviation for the seven profiles _ 
and three G.R.E. averages were determined for each four-year age 
inerement from nineteen to fifty-five (Table I). Also, not shown 
but available on request there are tables of inter age-group confi- 
dence levels of mean differences for the seven achievement profiles. 
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Fra. 2.—Variations in G.R.E. Non-Science Profiles and Intelligence with 


RESULTS 


Examination of Figure 1 shows that for the Sciences adult 
achievement loss with age is strikingly similar to the decline of 
mental ability with age. For Physics and Chemistry the age- 
| score curves almost parallel the declining mental ability curve. 
Maximum performance in Chemistry is maintained slightly 
longer and the decline is somewhat more gradual than for Physics. 
Mean scores in Biology, however, fall off sharply after reaching 
a peak in the nineteen to twenty-two age level. The loss here 
is greater than the corresponding loss in intelligence with age. 
Among the sciences, mathematics scores seem to hold up best. 
Significant losses were not noted until after age thirty-eight. 
‘This loss is considerably less than the corresponding loss for these 
_ Ages in intelligence. 
For Social Science there is no significant falling off of mean 
Achievement scores with age. The greatest mean difference is 
only 3.6, and that is found between two adjacent age gipa y 
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TABLE 1 


GRADUATE RECORD EXAMINATION PROFILE TEST SCORES FOR 
UNIVERSITY OF GEORGIA GRADUATE STUDENTS, BY AGE 


PHYSICS CHEMISTRY BIOLOGICAL SCIENCES MATHEMATICS. 
AGE Mean Mean Mean Mean 
Srour NOn ST, Scoe S40. NO, ST. SC.# SaDa Nos STe Stee SaDo Nos Ste Scot Side 


1822 432 50.39 9.68 128 51.40 10.87 132 54,54 9.14 91 49,27 9,88 


23-26 413 51.86 9.19 400 52.05 10.31 43 53.26 9.14 339 51410 toi 
27-30 354 51.33 10.21 326 50,78 354 52.00 9.20 288 5079 14 
31-34 216 50,84 9.41 202 50.25 216 49.59 9.71 164 49,43 {0 
35-38 205 49.52 10.00 175 47.84 204 46.92 9.60 153 51.23 148 | 
33-42 155 46.92 9.68 142 47,66 155 45.13 8.74 18 ATi 9460 
43046 98 47.42 10.81 8847.71 98 45.79 10.57 6B 47.49 Iii 
e @ 54 45.50 P919 53 48.07 S4 4362 9.02 A 47.40 100 
-54 34 42o 8.66 33 44,07 34 40.39 9.25 25 49,95 SiH 
55 up 22 459 9A 21 44.21 5,65 22 41.26 10,00 14 3864 120 
FINE ARTS EL TERATURE a SOCIAL SCIENCE | Geeks AVERAGES 
AR E E TRE Shi FRE in Frye SocaSte 
Grove Nos STe Stee SeO. NOs STe Bce S69. NOs STe Seet See Bios UT. 
CHEM, Fohe TO 
19-22 132 $2.96 132 52,82 9,73 132 $0.55 9.74 52412 52.11 Shell 
23-26 410 50,97 413 50,49 9.89 413 50.08 9.43 52,39 50,51 Sill 
27-30 343 43.29 a52 48.82 10.27 354 49.28 10.10 51.39 49.13 SOM 
31934 211 49,31 216 49.07 10.33 216 50.11 10.26 50.23 49.50 Adil 
35438 202 49,21 204 49.66 10.05 203 51.02 11619 48.11 49696 1h 
39-42 i52 48,93 185 50,38 9.12 155 49.67 9.74 46.54 49,66 al 
43-46 9 51.73 98 51.62 10.44 97 $1.38 10.41 46,95 5150 TE 
47-50 53 44.88 S4 47.9} 8,08 S4 47.74 8,3 46,28 46,86 do 
51-54 33 48.92 34 47.56 8.90 3448.34 8,45 -42,14 48T TE 
55 up 22 52.43 22 54,27 9,89 22 50.51 9,59 42,32 52,40 fhe 
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the forties, suggesting the probability of sampling errors or factors 
other than significant age-score trends (Table I). The smoothed 
Social Science age-score curve does not follow the mental ability 
curve but forms a line almost parallel to the base through the 
group mean (Figure 2). 

Fine Arts and Literature means not only hold up with age but 
also some statistically significant mean differences favor the older 
student (Table I). Only from about twenty to twenty-five do 
these smoothed age-score curves parallel the intelligence curv? 
(Figure 2). As was the case for Social Scienc®, the Literature and 
Fine Arts age-score curves approximate lines parallel to the base 
through the group mean. There is almost an imperceptible 
tendency for these three tests to become elevated in the forty t0 
forty-five age range, level off, and then rise again in the ove" 
fifty-five group. 
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The primary purpose of this study was to compare the decline 
with age of acquired knowledge and intelligence. Previous in- 
vestigations have implied that there could be expected a gradual 
decline in achievement test scores after about age twenty-four to 


twenty-five, the age according to Wechsler at which “our 


intel-@ 


lectual capacities are definitely on the decline” (8). However, in 
so far as the present writers have been able to determine, no 
previous attempt has been made to compare the rates of decline 
in adult achievement and intelligence with age, Peterson Side 
report in 1943 that “for each age there is a trend toward a lowered 
G.R.E. average score with increasing age” (6), but his findings 
were based on only thirty-two cases in the over twenty-nine 
age group and he made no attempt to compare intelligence and 
achievement decrements. By averaging G.R.E. profile scores 
the differential rates of decline were lost and the slope of the age- 


score curve was attenuated. 


There are several possible interpretations of the results presented 
above. For example, Peterson attributes the decline in G.R.E. 
Stores with age to “the fact one loses considerable facility or 
_ becomes ‘rusty’ in a subject for which one has little or no oppor- 


- tunity for practice while one loses little in a function which is 


Constantly practiced” (6). On the other hand if the G.R.E. is 
just a long test of verbal intelligence (6) it would naturally be 
expected to show the characteristic decline of mental ability with 


age, 


Although research evidence ‘is lacking on the point, Gilbert: 
believes “motivation plays an important part in intellectual 
decline with advancing age, for when motivation is absent, we find 


Waning interest in things and a decline in intellectual ac 


tivity 


and use of intellectual abilities. The result of this disuse, of 
Course, is accelerated decline” (1). Still other explanations may 
occur to the reader;.such as, for example, the inadequate sampling 
techniques inherent in cross-sectional studies. The fact that if 
the older age groups are found to be heavily weighted with women 
and/or Social Science, Literature, or Fine Arts majors would 


account for the sustained mean scores in these areas. 


The results presented in the previous section would appear to 


Support the hypothesis of a differential rather than a cog 
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rate of decline of acquired knowledge with age. Only the science 
profiles of the G.R.E. follow the characteristic curve of mental 
decline with age. Social Sciences, Fine Arts, and Literature 
scores seem to hold up well even at the extremes of the age range. 
As the rates of decline are largely functions of the nature and 

.,content*of the tests used these findings should not necessarily be 
expected to hold for different tests, other age groups, or for any 
individual student. 
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THE SOPHOMORE GENERAL CULTURE 
TESTS AND STUDENT KNOWLEDGE 
OF CULTURAL FACTS 


BERT D. ANDERSON ° 
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Southern State College, Magnolia, Ark. 


The problem of appraising the college student’s knowledge of 
cultural facts has received a good deal of attention in recent years. 
Special interest in this matter is justified since edutators generally es 
agree that American colleges have the responsibility of acquainting 
students with scientific and cultural information that is felt to be 
basic in perpetuating our society. Various programs of general 

= education have been devised to improve the student’s knowledge 

_ Of his heritage and the responsibilities he must accept if he is to 
become a worthy citizen in these critical times. In all of these 
Programs it has been assumed that if the student possesses a 
certain body of general information he is more likely to become 
an effective citizen in our democracy. 

Perhaps the Sophomore General Culture Test in its various 
forms has been more widely used than any other single instrument 
to estimate the extent to which the college student has acquired a 
knowledge of cultural facts that are assumed to be significant in 
our society.’ It is common practice to administer this test at the 
end of the sophomore year when the student has completed most 
of his general education training. 

The major research pertaining to this instrument has been 

= Conducted on a national basis of the Educational Testing Service 
through its national college testing program. Their findings have 
enabled individual colleges to compare the performance of their 
students on this test with that of a national college group. They 
have demonstrated that there is a wide variation in scores made 
Within and between colleges on the Sophomore General Culture 
eee However, for the local administrator of the general 


‘One hundred seventy colleges representing thirty-seven states partici- 
Pated in the 1950 College Sophomore Testing Program. À 
* Committee on Educational Testing, “Report of the sophomore testing 
Program,” Educational Record, xiv, 1933, pp. 522-71. © 
* The 1932 College Sophomore Testing Program, “A report bygphe Ad- 
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education program and the counseling staff these very practical 
questions arise: (1) How comprehensive is thé coverage of this 
test when its items are related to course offerings found within the 
general education program? (2) How should the student's per- 
formance on this test be interpreted when related to the instruc- 
tion he®has received in the local general education program? 
(3) What do these scores tell us about the student’s knowledge of 
cultural facts? 

A partial answer to the above questions was found at the Uni- 
versity of Utah by (a) tabulating the Sophomore General Culture 

‘est: items acéording to the courses to which they were related 
that were offered within the general education curriculum and 
(b) ascertaining the relationship between scores received on the 
General Culture Test and the student’s past achievement in four 
separate areas of general education. 


SAMPLE 


In the spring of 1950 the Sophomore General Culture Test, Form 
Y, was administered to eight hundred and forty-nine students 
who were completing their sixth quarter of work at the University 
of Utah. They had finished most of their general education 
classwork requirements in the Physical Sciences, the Biological 
Sciences, the Humanities, and the Social Sciences. Two hundred 
students, whose scores and stated professional aims were found to 
be representative of the total group, were selected for more inten- 
sive study and used as the sample for this research. 


FINDINGS 


I) Test items coverage of curriculum content—Students were 
required to complete at least nine credit hours in each of four 
general education areas. These were the Biological Sciences, the 
Physical Sciences, the Social Sciences and the Humanities. The 
number of courses made available to the student varied from ten 
in the Biological Sciences and the Physical Sciences to thirty- 
pila eee enn ee CIEN COS IY 
visory Committee on College Testing,” Educational Record, x111 1932, PP: 
290-343. i 

4 “The Report of the 1936 College Sophomore Testing Program,” Be 
cational Record, xvir, 1936, pp. 577-608. ” 

> 5 “Report on the 1950 National College Sophomore Testing Program, 
a Qoöpeive Test Division, Educational Testing Service. pp. 1-4. 
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| Taste I.—NuUmBER or Tust Irems Devorep TO SUBJECT MATTER 
TAUGHT IN THE SOCIAL SCIENCE AREA OF GENERAL EDUCATION 
Beenie As minme] miget | Se PE EY bok St 
Social Science Area o 
Economics 6 1 7 o 
Geography 2 0 a 
History 11 1 48 
Health* 8 0 
Anthropology 1 0 0 
Political Science 7 19 & 4o Se 
Sociology 3 25 0 
Psychology 2 0 4 
Philosophy 2 0 0 
Social Sc. Survey 3 U.C.** UC" 
37 54 74 


* Not offered in this general education area. 
** Unclassified. 


seven in the Social Science areas. The various courses in each of 
the four areas were listed separately so that the items of the 
appropriate Sophomore General Culture Test could be related to _ 
the course content being offered. Some of these findings are 
presented in Table I. 

In general, the test items found within the Current Social 
Problems and History and Social Studies Tests provided inade- 
quate coverage of course subject matter being offered in the 
Social Science area. While there were forty-nine History ques- 
tions contained in these two tests, it should also be noted that 
there were eleven separate History courses offered. There were 
No questions pertaining directly to Philosophy or Anthropology, 
yet these two courses were regularly being taken by students 
fulfilling the general education requirement in this area. : 

In the other three areas, as was the case in the Social Sciences, 
course offerings were in many instances covered by too few items 
or none at all. Only ten courses were offered in the Biological 
Science area as compared to thirty-seven in the Social Sciences; 
however, only twenty-seven items of the Science Test were directly 
Telated to Biological Science courses while the remaining, thirty- 
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three items were designed to cover subject matter taught in ten 
courses offered in the Physical Science area. In specific subject 
matter situations the lack of coverage was also noted. For 
instance, the Literature and Fine Arts Tests, each containing 
ninety items, could best be applied to the courses offered in the 
Humanities area, A review of these items revealed that there 
Swere no questions asked in either test about such famous philoso- 
phers as Berkely, James, Bergson, Russell or Nietzsche, yet the 
Philosophy offerings within the Humanities area stressed the 
works of these men. The General Culture Mathematics Test, 
PP while perhapsimportant from other standpoints, bore little or no 
direct relationship to subject matter offered in any of the four 
general education areas, 

II) Achievement in General Education related to scores received on 
the General Culture Test—To discover the relationship between 
the student’s past achievement in courses completed in the Bio- 
logical Sciences, the Physical Sciences, the Humanities, and the 
Social Sciences, separate grade point averages were computed for 
each area and used as the criterion of achievement. These grade 
point averages were then correlated with the scores he had received 
on the General Culture subtests. The Pearson Product Moment 
correlation method was used to establish these relationships. 
The results are reported in Table II. 

Each subtest, as well as the total scores of the Sophomore 
General Culture Test, correlated positively with achievement in all 


Tasty Il.—Tum CORRELATION Conrricrents FOUND BETWEEN THE 
GRADE-POINT Averages IN EACH OF THE GENERAL EDUCATION 
ÅRBAS AND ScoRES RECEIVED ON THE SOPHOMORE GENERAL 


CULTURE SUBTESTS 
Soano a an D y aA 
E Tot. 
General Education Area gare Esto Lit. | Science moe Math. Oat, 
gel A Re is 
Biological Science Grade | .374 | 366 | .311 | .507 | .320 | .357 | -506 
Point Average 
Physical Science Grade | .211 | .315 | -288 | .526 | .147 | .429 | -42 
Point Average 
Social Science Grade Point | .344 | .333 | .374 | .356 | .280 | .254 | 444 
Average ; 
Humanities Grade Point | .146 | .204 | .282 | .209 | .262 | .076 | -38 
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‘four general education areas. With few exceptions,. these rela- 
tionships were found to be significant at either the one or five per 
cent level of confidence. Only the Science Test and the Total 
General Culture scores correlated well with the grades earned in 
the Physical Sciences and the Biological Sciences. The total 
score was found to be the best indicator of the studerft’s past 
achievement in the Social Sciences while none of the General 
Culture subtest scores were closely related to achievement in the 
Humanities. The total score was the most consistent index of 
past achievement in general education even though there was a 
tather wide range in the size of these correlation tdeficients. o Sums 


DISCUSSION 


Mhe breadth of subject matter covered within the general 
education program presents an almost overwhelming problem 
when an attempt is made to measure student knowledge in this 

area, With so many courses being offered in each of the general 
education areas, it is only reasonable to suspect that many differ- 
ent course combinations will be presented by students in fulfilling 
_ the general education requirements. The difficulties involved in 
constructing an instrument that will adequately cover all of these 
subject matter areas becomes clearly recognized. With only a 
few items to cover several complete courses in some cases, perhaps 
it was not at all surprising to find low correlation coefficients 
between the grade-point averages earned in each of the four areas 
and the student’s test performance. 

The lack of relationship between test scores and achievement 
and the very limited coverage of test items leads one to suspect 
that the student’s General Culture scores will not necessarily 
indicate how effective the local general education instruction has 
been in improving the student’s knowledge of cultural facts. On 
the other hand, neither does the grade-point average earned in 
each of these four areas give a well-rounded picture of student 
knowledge of cultural facts, since it is based upon a limited per- 
formance in two or three courses in a broad subject matter area. 
For instance, the student might complete the Humanities require- 
Ment without ever having completed a course in Literature. 
Sea it is at this point that the General cre ninia 
Test can serve a useful purpose. If, in such a case, the counselor 4 
is willing to assume that the student’s performance on HD erg 
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ture test represents a rather independent but objective index of 
the student’s knowledge of literature, then these scores can be 
used to supplement grades in appraising the student’s knowledge 
of cultural facts in the Humanities area. If these General Culture 
Test results are regarded more or less as an independent index of 
culturaleknowledge then their validity must be defended upon the 
®premise that the test is not only technically sound, but is based 
upon subject matter that is educationally significant. The lack 
of relationship between General Culture test scores and past 
achievement in General Education courses does tend to support 


Phe» hypothesi that these results do provide an independent 


meastire of student knowledge in these general education areas. 
It seems quite clear that no single measure will give a well-rounded 
picture of the student’s knowledge of cultural facts. 


CONCLUSIONS 


1) The General Culture test does not provide adequate test 
item coverage when a number of elective courses are made availa- 
ble to students completing their general education requirements. 

2) TheGeneral Culture test results do not indicate how effective 
the local genërål education instruction has been in improving the 
student’s knowledge of cultural facts. 

3) The General Culture test results may be used as an inde- 
pendent index of the student’s knowledge of cultural facts if it can 
be assumed that the test is technically sound and based upon 
subject matter that is educationally significant. 
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FINAL EXAMINATION PERFORMANCE IN 
TRADITIONAL AND CROSS-DISCIPLINARY 
COURSES 


IRWIN A. BERG and RAYMOND J, MURPHY! è 
a 
Northwestern University 


While the number of interdepartmental or cross-disciplinary 
courses appears to be increasing among American colleges and uni- 
versities (2, 3), therisein popularity of such courses HS been accom Was 
panied by pointed questions as to their academic value when com- 
pared to the traditional, single-subject courses. For beginning 
college students, the traditional course is regarded by many as 
more ‘solid’ and as providing a firmer basis for mastery of the sub- 
ject matter, while the cross-disciplinary courses are thought to 
encourage superficial and popularized treatment of several fields. 
This attitude, understandably, has been indignantly challenged by 
proponents of inter-departmental course offerings. Curiously, noth- 
ing seems to have been published which offers factual data bearing 
upon the problem. Ty aa 

Tn 1949, with the support of a grant from the Carnegie Founda- 
tion, a cross-disciplinary course known as An Introduction to the 
Sciences of Human Behavior was jointly offered by the Depart- 
ments of Anthropology, Psychology, and Sociology of Northwest- 
ern University. The course was not intended to be merely a survey 
of these fields. Rather, the aim was explicitly one of providing the 
students enrolled in the course with the same concepts, the same 
fund of information had they taken the three introductory courses 
in anthropology, psychology, and sociology. Thus the inter-disci- 
plinary course was intended to serve as a prerequisite for advanced 
Work in any or all of the three fields in the same way as the tradi- 
tional courses. But more than that, by bringing the fruits of scien- 
tific research and the explanatory concepts of the three disciplines 
, t0 bear upon a general aspect of human behavior, it was expected 
that the students who completed the interdisciplinary course would 
achieve a broader and firmer grasp of the subject matter than stu- 


‘The authors wish to express their thanks to Professor Kimball Young 
for his critical reading of the manuscript. @ 
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dents who took the three traditional introductory courses sepa- 
rately. 

The present study attempts an answer to the question of whether 
students who learned their elementary psychology in the cross- 
disciplinary Sciences of Human Behavior have mastered the sub- 
ject matter as well as those who completed the regular or traditional 

beginning psychology courses. While the human behavior students 
were exclusively freshmen and the psychology content of their 
course was spread over three academic quarters, it was nevertheless 
believed that a reasonable comparison could be made of the two 


iP eroups even vavugh the traditional elementary psychology course 


J 
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students were sophomores and above and had their psychology ina 
one-quarter course. The basis for comparison was sixty multiple- 
choice test items which were administered to both the human be- 
havior and traditional psychology classes by including these items 
in the final examinations during the academic year 1951-52. 
Twenty items were administered in each of the fall, winter, and 
spring quarters to a total of three hundred and eighty-four tra- 
ditional psychology course students and to approximately: three 
hundred human behavior students. Although the sixty test items 
sampled the “entire range of subject matter for elementary psy- 
chology, only twenty items representing about one-third of the 
total material could be given each quarter. This division was neces- 
sary because the human behavior students have the psychology 
content of the course distributed over the entire academic yea, 
while the traditional psychology students complete their course in 
one quarter. The total hours spent in class and the textbook assign-, 
ments were the same for both groups. The test items were selec 
from the reading assignments required of both groups. Agreement 
among the human behavior and regular psychology instructors was 
used in choosing the items for inclusion in the final examinations. 
The test items which were identical for both groups were placed 
in a block at the beginning of each examination. The mean score 
for each examination was computed for both groups and for ene 
academic quarter, and the significance of the mean score differ- 
ence tested by determining critical ratios. A 
Because a variety of entrance tests are used by the University $ 
Office of Admissions, it was possible to secure scholastic aptitude 
test scores for only one hundred and eighteen human behavior 
agud and forty-one traditional psychology students who 
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: taken the same form of Northwestern Analogies Test (1). A check 
of the general academic performance of these students for whom 
Analogies scores were available indicated that the groups could be 
regarded as roughly equivalent. Comparisons were also made of 
the identical item scores for students from the various colleges of 
the University. This was regarded as necessary because thë human 
behavior course draws its enrollment chiefly from the College of 
Liberal Arts while the traditional psychology course includes stu- 
dents from virtually every school on the campus. Since the propor- 
tion of male to female students in both courses was about the same 
no control over this factor was deemed necessary. ey . T 


k 


DISCUSSION 


: Viewed as a pattern for the year, the performance of the human 
behavior and regular psychology groups may be regarded as reason- 
ably comparable. As reference to Table 1 will show, the traditional 
psychology group earned a significantly higher mean score on the 
final examination items for the fall quarter; however, the human 
behavior group performance was slightly higher, though not signifi- 
cantly, on those items given to both groups for the winter and 
spring quarter examinations. 
Two factors probably account for much of the difference in the 
Mean examination scores of the two groups for the fall quarter. 
~ First, the course plan of the human behavior group requires that 
4 considerable portion of the fall quarter be devoted to an elabora- 
tion of the terms and concepts used in anthropology, sociology, and 
«psychology. While this time is well spent, since it provides a solid 
| foundation for the rest of the year’s work, it does mean that the 
A A 
TABLE 1,—FINAL EXAMINATION PERFORMANCE OF TRADITIONAL AND 
Human Besavior GROUPS 


Fall Quarter Winter Quarter Spring Quarter 

Course A ae wets) ES 
. ne | m | sD |CR|N*| M | SD cr| N° | M|SD bead 
_ Traditional 150/13.3| 2.8 114111.9) 3.2} | 120113.9] 2.6 ; 
9 i 


3.9) : 
Human Behavior | 300)12.2| 2.9 265/12.2| 2.5 260/14. 1) 2.6) 


tudents represents different 


“The number of traditional psychology 5 € 
classes during each quarter. The human behavior group varies ga 
use of absences from the final examination. a ms 
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cross-disciplinary students have a temporarily leaner academic fare 
for the psychology portion of the fall quarter when compared to 
the regular psychology course. The second and perhaps more im- 
portant factor is that of academic experience. Being younger and 
completely new to the college situation, the human behavior stu- 
dents sould be most handicapped during the fall quarter when 
compared to the older and academically more experienced tradi- 
tional psychology students. This advantage should be reduced in 
the winter and spring quarters as the human behavior group im- 


roved in study skills, became familiar with multiple-choice tests, 
Pg 


tor Such was fttually the case as shown in Table 1. 

Thus far our data indicate that the traditional psychology and 
the human behavior groups were, from an over-all standpoint, per- 
forming about equally on the sixty psychology examination items 
administered during the year. However, while total final examina- 
tion performance was approximately the same for both groups, it 
is emphasized that the interdisciplinary students were performing 
as freshmen about as well as the regular psychology students were 
performing as sophomores or above. While the two groups were 
quite comparable in over-all performance on the psychology test 
items, it cannot be argued that the human behavior students would 
necessarily have benefited even more had they been sophomores, 
as were the regular psychology students. True, the interdisciplinary 
students, though freshmen, demonstrated a level of mastery of 
psychology which was comparable to sophomores. However, it has 
not been shown that the regular psychology students would not 
have done just as well had they taken the course in the freshman, 
instead of the sophomore, year. Furthermore, although many o 
the weaker students leave college during the freshman year, it 18 
possible that motivation for topflight performance may decreas? 
among the more capable students in the sophomore year, once they 
have tested their academic mettle as freshmen and found it satis- 
factory. The freshmen in the human behavior course learned a8 
much psychology as the sophomores in the traditional psychology 
course, according to the data presented; and that is about all that 
can be said in the light of available information. 

A question which naturally arises is whether one group W4% 
brighter than the other. Since scholastic aptitude test scores pe 
available for only a limited number in both groups, no very positive 


` Seige can be made about the influence of this factor. As show? 
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1952 


Course N M sD CR 


Traditional 41 87.1 20.6 
a0 


Human Behavior 118 91.3 22.2 © 


in Table 2, the evidence we have indicates that there was no sig- 
nificant difference between the two groups. The grade point aver- 
age for all courses of the forty-one regular ma a studertts ms 
was almost exactly at the mean of the grade point average for the 
total sophomore class. The mean Analogies test score for these 
forty-one students was 87.1, while the mean Analogies score for 
all students who took the test at the same time was 86.7, Thus it 
appears that the regular psychology group sample was probably a 
representative one in terms of performance on the test and for 
_ grades earned. The one hundred and eighteen interdisciplinary 
_ freshmen for whom Analogies scores were available earned a grade 
point average which was somewhat above that of the freshman 
class, The mean Analogies score for these one hundred and eighteen 
freshmen was 91.3, while a mean score of 86.4 was earned by all 
entering freshmen. It is suggested that these data strengthen the 
belief that the human behavior and traditional psychology students 
Were comparable in scholastic aptitude since the slight, but non- 
Significant, difference in Analogies test performance seems very 
| likely to be due to a minor sampling bias. That is, the sample of 
| human behavior students had a slightly higher mean scholastic 
aptitude score but they also earned slightly higher grades than the 
sample of the regular psychology students. This is what would be 
| @xpected if the sample was slightly biased in favor of the human 
= behavior group. 
As noted earlier, the regular psychology course draws its enroll- 
ment from nearly every college of the University while the inter- 
disciplinary course draws chiefly from the College of Liberal Arts. 
Tf the students from one college were notably stronger or weaker 
Academically, the mean final examination scores might be influ- 
enced by such a deviant group. Tables 3, 4, and 5 present data k 


bearing upon this point. While the mean psychology finalggxami- , 
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TABLE 2 NORTHWESTERN ANALOGIES Test SCORES FOR SPRING QUARTER 


TABLE 3.—Finat EXAMINATION PERFORMANCE OF TRADITIONAL 
Psyonotocy GROUP ANALYZED By SCHOOLS FOR SPRING 
QUARTER 1952 
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DIFFERENCES AMONG SCHOOLS REPRESENTED IN TRADITIONAL 
PsycHoLtocy FoR SPRING QUARTER 1952 


Taste 4.—CrrtTicaL Ratio MATRIX or FINAL EXAMINATION MEAN SCORE 


Taste 5.—Finat EXAMINATION Mean Score DIrrerences BY 800013 
REPRESENTED IN THE HUMAN BEHAVIOR COURSE FOR SPRING QUARTER 


es S 


nation scores varied somewhat for students from the different a 
leges, none of these differences were of statistical significance | 


i te that differential performance on the psychology 
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Aes ee a five per cent level of confidence. Thus there is no eviden® 
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tion items had any significant influence on the size of the 
group means. 

conclusion, it appears reasonable to state that our evidence 

| that the freshmen in the interdisciplinary course learned as 

E h psychology as the sophomores and above in the traditional 
Course, Whether the interdisciplinary students would haveglemon- 
strated even greater achievement had they been sophomores, or @ 
whether the traditional psychology students would have performed 
as well or better had they taken elementary psychology as fresh- 


not known. No evidence is available which bears on this 
ao" E 


Roe: SUMMARY _ 

A study was made of the final examination performance of three 
hundred and eighty-four students in the regular or traditional psy- 
chology course and approximately three hundred students in the 
_interdisciplinary human behavior course which is offered jointly 
by the departments of anthropology, psychology, and sociology 
at Northwestern University. Sixty test items which sampled the 
A range of material covered in elementary psychology were adminis- 
‘tered to both groups as part of the final examinations during 1951- 
52, Though composed entirely of freshmen, the interdisciplinary 
students in over-all performance did about as well as the traditional 
“course students who were sophomores and above. No evidence was 
. C that the two groups differed significantly in scholastic apti- 
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Heren M. WALKER AnD Josera Luv, Statistical Inference. New 
York: Henry Holt and Co., 1953. 510 p. $6.25. 


At long last we have a textbook in statistics for students of 
education which is non-mathematical, comprehensive, accurate, 
® and up-to-date. Though the title of the Walker and Lev book 
indicates that it may be of interest to researchers in many differ- 
ent fields, the senior author is well-known as a professor of educa- 
tion and virtually all ‘the material in it is written in the context 

of problems if educational research. 

Although it may not be*considered an elementary text, since by 
conventional standards it includes ‘advanced topics’ in statistics, 
it features the intuitive approach to the development of statistical 
concepts, an approach designed for the student not prepared to 
read mathematical literature. The first nine chapters are develop- 
mental and designed to cover the basic concepts of statistical 
inference. They include essential ideas of probability, population 
and sample, sampling distributions, chi-square, inferences con- 
cerning the mean and variance, and analysis of variance. The 
remaining chapters depend upon the theory introduced in the 
first part of the book in applications of statistical theory to such 
problems as correlation and regression, measurement and relia- 
bility, analysis of variance and co-variance. Included is an 
excellent final chapter on the very practical subject of non-para- 
metric methods. An example of much needed modern content 
not hitherto available in books for students in education is 4 
treatment of sample survey design. 

Tn the first three chapters, through the medium of simple nu- 
merical problems, Walker and Lev ‘intuitively’ introduce the 
student to most of the statistical concepts he needs to understand 
statistical inference. This includes an introduction to probability | 
and probability distribution (including standard symbolism for 
expressing probability statements), the notion of parameter versus 
statistic, the binomial distribution, probability density, mathe- 
matical expectation, the normal probability distribution, random- 
ness in sampling, statistical estimation, the power function, 4? 
sampling from a finite population. Later chapters are unusually 

> thorough, accurate, and complete on the subjects they cove" 
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The usefulness of the publication is enhanced by generally good 
illustrations and examples throughout, and an exceptionally 
comprehensive set of statistical tables and charts in the appendix. 
Good references are included for students who might be interested 

` jn further pursuit of topics. 

Legitimate comments on this book must be so overwhelmingly 
in its favor that criticisms of it seem trivial. The writer of this 
review found the gains in the adoption of this text in an, advanced 
course in educational statistics to outweigh by far ‘the. difficulties 
encountered with it. For a first edition"there are a negligible 

number of typographical and numerical errors not Pered inch 

errata sheet supplied by the publisher. For instance, a bar is left 
off of an X in equation (9.25), and what should be 77 appears as 
73 in an equation at the bottom of page 214. These are of minor 
consequence in the use of this text. The writer found but one 
example which seemed to distract students from the statistical 
objectives. This appeared in Chapter 9 in an otherwise excellent 
development of the model for analysis of variance. It dealt with 
archery scores, the conceptualization of which seemed about as 
difficult for students to grasp as the concepts they were used to 
illustrate. This is an exception, however, since in general exam- 
ples are realistic and well chosen. 

. The unique order of presentation of materials covered in this 

book may present some difficulties. This will depend upon how 
it is used. The introductory chapters emphasize models, theory, 
and concepts, and progressively and adequately build up a work- 
ing background. However, the writer finds that inexperienced 

students find this somewhat tedious. Even in advanced courses 
education students appear to need an occasional opportunity to 
examine the application of a principle in a number of different 
Ways. In one sense, it is logical to initiate such things as the 
Power concept early in a course in statistical inference. From 
the standpoint of efficiency of instruction, however, this may 
profitably be delayed until after experience has been offered in 
some of the operations of statistical inference which does not 
Tequire it. Related concepts of efficiency and consistency of 
estimates appear early in the text but are not clinched except in a 
Supplementary chapter dealing with percentiles (or ‘inefficient 
statisties). Such matters, of course, are in the category of choices 4 

_ Which usually are expected to be faced by an instructae he 
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adapts to his class and his own objectives. The writer found it 
necessary, for instance, to supplement and emphasize equation 
(9.9) on the partitioning of sums of squares in a simple analysis of 
variance problem, an important basic concept appearing to need 
proof and elaboration. It is only fair to say, however, that there 
are und6ubtedly many alternatives in arranging the sequence of a 
course in statistics and that it is too much to expect a text on this 
level to be self-teaching. “ 

The authors in their preface suggest that some students might 

dy the,text without previous courses in statistics but that those 

teat and quantitative thinking might profit by a 

“slow-paced introductory course”. In most teachers colleges and 
colleges of education this text will probably be most useful in a 
second and/or a third semester course. As an advanced text it 
should be widely accepted. Tt will be found to be an efficient 
vehicle to the understanding of modern statistical methods in 
educational research. The material is there and the exposition is 
precise and clear. Francis G. Cornett 

University of Tilinois 


Winu1aM Srernenson. The Study of Behavior, Q-Technique and 
its Methodology. Chicago: University of Chicago Press, 1954, 
pp. 376. $7.50. 


This book does not, as the title implies, deal with behaviour, at 
least, not in the sense that the term is usually understood by 
psychologists; it deals essentially with techniques of obtaining 
self-ratings and ratings of others, called Q-sorts by Stephenson, 
and with techniques of correlating persons and factor analyzing the 
resulting tables of correlations, called Q-techniques by Stephenson. 
Tt constitutes, therefore, a summary and detailed statement of 
work carried on intermittently by Stephenson since 1935, and well 
known in its main aspects to statistically-minded psychologists. 
Those who have followed this work, and the many controversies 
to which it has given rise, will turn to this book in anticipation of 
finding a detailed discussion of the main points, together with a 
empirical proof of the effectiveness of the methods suggested. 

Of the two innovations which Stephenson claims to have — 

è duced, it may be said that the Q-sort presents a somewhat no 
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l ‘and, under certain circumstances, a probably useful technique of 

 sli-rating. The subject is given a large number of statements 

| to those appearing in orthodox questionnaires; he is then 

‘to rank these in order of applicability to himself, putting 

| "the one that characterizes him best at the top, and the one least 

| like him at the bottom. The relationship between a Qgort and 
A *Yes’-"No’ type of questionnaire appears to be essen-© 

‘the same as that between a product-moment correlation and 

shoric correlation; though both do pretty much the same 

thing, under certain circumstances the one may be prefera- 


ders a Q-sort to be preferable to an ordinary questionnaire, 
is reviewer, at least, found it difficult to see why claims 
for the Q-sort should not be applied in equal measure to the 
despised questionnaire. On the evidence supplied by Stephenson, 

, we can only conclude that the Q-sort, while ingenious, is 
ly to support the elaborate super-structure erected on it by 


ith regard to the technique of correlating persons, we find 
ourselves in the midst of a very stormy controversy. Most 
psychologists have pointed out that the techniques used 

à are not new, that many of the detailed applications 

Are subject to damaging statistical criticism, and that the results 
in where the method can legitimately be applied are no 


onsen, after examining the evidence carefu iy, 
At the critics are right and that anyone wishing to tse 
err bo well advised to Genus the methodological 
pri involved with someone expert in this field. 

does not help his case by showing a somewhat 


tendency to change the meaning of the term Q- 
b over the years, whilst ing that what he now 
by it is what he has meant by it all along. — 


‘ation of his earlier papers and the examples quoted therein is 
fient to show that his views have changed considerably. 


5 € 
a. fA 


the other. Stephenson never discusses in avail why, Ha 
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is no harm, of course, in changing one’s views, but it does make 
controversy exceedingly difficult to follow when, quite justifiable, 
criticisms of earlier work are rebutted by referring to later and 
quite different definitions and interpretations. 

Stephenson has, from the beginning, illustrated his discussions 
by meaas of small-scale experiments, not, presumably, meant to 

®be taken very seriously and serving only illustrative purposes. In 
the present book, one might have hoped for a somewhat longer, 
detailed, and worked-out example of his technique, demonstrating 
recisely how it could be useful in advancing psychological theory 

q “Rnowieate. Stephenson has not taken this opportunity, 
however, but has relied again on a large number of very small and 
often exceedingly trivial examples. It is difficult to judge a 
method in the absence of even a single instance where it has been 
applied to its best advantage. However, on its present showing 
it seems difficult to take very seriously the somewhat extravagant 
claims which Stephenson makes for it. The reviewer is willing to 
reverse his judgment if a convincing example could be given to 
demonstrate the usefulness of Q-technique. In view of the failure 
over a period of twenty years to provide such an example, however, 
it seems unlikely that this judgment will be reversed. 

One last feature of the book calls for comment; namely, its 
style. A great deal of the argument is difficult to follow because 
of Stephenson’s somewhat explosive and disjointed manner of 
writing. There is little in the way of sustained argument, but a 
constant switching of discussion from statistical detail to philo- 
sophical speculation, to attack on critics, to illustrative example, 
and back again to statistical detail. Distrusting his own reaction 
to this, the reviewer handed the book to several statisticians who 
confirmed his opinion that, from the statistical point of view at 
least, it was impossible to find any concise, consecutive, and 
properly documented demonstration of Stephenson’s main claims. 

In summary, there is much that is ingenious, interesting, and 
suggestive in this book, but it cannot be recommended to anyone 
not thoroughly familiar with the field and able to discount the 
more unorthodox and less acceptable parts of the argument. 

H. J. EYSENCK 

Maudsley Hospital, London 
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Drug Addiction among Adolescents. Sponsored by the Committee 
on Public Health Relations of the New York Academy of 
Medicine with the Assistance of The Josiah Macy, Jr. Foun- 
dation. New York. The Blakiston Co., 1953, pp. 320. 


There was widespread alarm concerning the growth iff the use, 
of drug addiction in adolescents in the early fifties. In New York a 
number of committees were formed to better combat this evil. One 
of them was sponsored by the Welfare Council of New York City. 
A committee of fifty-eight decided to call a conference for ihep 
pose of obtaining the thinking of authorities in medical, psychisiric, 
sociologic, educational, and law-enforcement fields. This was done 
with the help of the Josiah Macy, Jr. Foundation. This volume re- 
ports two sets of such conferences: a one-day conference held at 
the New York Academy of Medicine on November 30, 1951, re- 
ported in the first section of the book. The subcommittee held the 
second conference which lasted for two days without any prepared 
papers except a listing of subjects for discussion to be presented to 
the group. This was held at the New York Academy of Medicine 
on March 13 and 14, 1952, and the supposedly spontaneous discus- 
oe was recorded and is presented in the second section of the 

k. 

One of the conclusions that came out of the first conference was 
a more alive awareness on the part of the participants that drug 
addiction among adolescents is primarily a sociological, psychologi- 
cal, and economic problem and only, secondarily, a medical one. 
There were some participants who emphasized the abnormal psy- 
chology or medical slant. But, generally, a realization that only a 
multiprofessional approach was at all promising seemed to hold 
sway. Almost every phase of the problem was considered; not in any 
thorough going fashion, however. Some of the problems concerned 
With were the need for elimination of concepts of moral depravity 
that doctors still have; the comparisons based on reminiscences of 
some of the participants with factors which were realized thirty 
Years ago; the reason for the increase in adolescents’ use of drugs; 
how the craving could be prevented; how the supply can be 
stopped; how to rehabilitate those who have been addicted; the 
Tle of education; what can be done in an institution; and what 
can be done in other directions including the education of parents. 
All of these and other related problems are considered 
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conferences. One of the direct results of these two conferences was 
the inauguration of a project to correlate and integrate data on 
adolescent drug addiction gathered by the magistrates’ courts of 
the city of New York. With the assistance of the United States 
Public Health Service, the project is now being carried on in New 
York University. 

A small agenda committee might have prepared the material so 
that more alive and present-day efforts would be considered from 
the point of view of action for research and practice. But the Macy 

tion deserves congratulation on arranging for this confer- 
th in the publication of the materials of it. People 
in the field will not find this hard reading; they will find it segmented 
and piece-meal, however. Short comprehensive summaries would 
have helped clarify the presentation for the reader. The nearest 
thing to a feature of this kind shows up in the early portion of the 
second conference where Dr. E. H. L. Corwin is invited to sum- 
marize the first conference. For orientation purposes the confer- 
ences evidentally served. The appendix contains a glossary of terms 
used from drug addiction—argot, this is called—and also an in- 
dex. In a short but enlightened introduction by Dr. Frank Fre- 
mont-Smith, Medical Director of the Josiah Macy, Jr. Foundation, 
the purpose and significance of these conferences is expressed more 
coherently than in the other contributions. ‘What has developed, 
says he, “is a much clearer recognition that drug addiction in ado- 
lescents must be seen, not as a problem of moral degeneracy, not 
merely as a failure of law enforcement, but rather as one symptom 
of the serious deprivations suffered by many children living ™ 
large and crowded cities.” H. MELTZER 
Psychological Service Center 
St. Louis, Missouri 


M. E. Bennerr. College and Life. New York: McGraw-Hill Book 
Co., 1952, pp. 457. $4.50. 


The author says that College and Life “deals with the wan 
problems of learning and living imcollege, which each student mus 
face and solve for himself as a democratic citizen in his college 
community in order to achieve the real values of a college on 
tion.” Again, “Vocational guidance and mental hygiene form t! 
p 008 the materiale Tio ae Pook- - 
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ree parts to the book. In the first of these, entitled 
the general college scene is discussed from 
lew. The student is asked to consider seriously 
he is one of those who really should be in college. 
ident may expect to get out of college in return for the 
noney invested is discussed. Such aspects & collegg- 
-curricular activities, customs and traditions of one’s 
dgeting, kinds of majors, trends in college curricula, 
of scholarship, participation in school politics as an 


citizen, and learning self-direct iog a 
The aim is stressed of achieving effective l6cal, 


ternational citizenship. Bennett says “College ex- 
guld provide every student with some excellent training 
ae \oo8e a leader, how to follow a leader, and how to be s 
” These are sound aims, too often neglected in the thinking 
1S. msible for the training of students of all ages, and 
t in the thinking of the students themselves. 
‘0, “Learning in College,” four chapters are given to 
e and conditions of learning and retention, and to effective 
ts. Chapter 8, the discussion of conditions for effective 
ld need implementation by way of reading or class 
The usefulness of the study-conditions and health in- 
| provided in Chapter 9 is limited by the student's probable 
uate criteria for answering. The exercises for improving 
hapter 10 may also lose in effectiveness from the stu- 
y to evaluate his faults and to outline changes in his 
ces. The remaining five chapters of this section touch 
the library, effective silent reading, note-taking, think- 
purpose, and the appraisal of progress in college. Gener- 
hapters contain much of value to the beginning student. 
such exercises as the evaluation of reference materials 
owledge and experience not possessed by many fresh- 
usual freshman would also need a great deal of help with 
inventory, inasmuch as he simply would be unable to 
ne of the judgments required. The chapter on purposeful 
seems quite good, and includes exercises which should be 
‘The chapter on appraisal of progress should help to reduce 
lent’s fear of examinations through understanding of their 
d give him techniques for doing well in them, other things = © 


. 
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Part Three, “Building a Life,”aims at the development of per- 
sonality, achieving mental health, developing a life philosophy, 
and achieving satisfactory adjustments in family life and in demo- 
cratic citizenship. As is the case with the other sections, there is a 
lot here of value for the beginning student, but much of it is pre- 

ented irfrather a shallow manner, and the exercises for the student 
call for more experience and knowledge than he is likely to have. 
It would seem that the demands made upon the instructor, if the 
material is to benefit the student very much, are great. 
peat te aN this reviewer that throughout the book the exercises 
‘and indicated discussions would call for a counselor considerably 
better-trained than the usual ‘mere’ instructor. The somewhat 
elementary level of presentation of much of the material may be 
justified in view of its intended use with beginning college students, 
but adequate use of the material to bring about good adjustment 
and recognition and acceptance of the stated goals on the part of 
the students would require the services of a teacher well-trained 
in clinical psychology or counseling. If such a person were available, 
this book should provide a basis for an excellent orientation course. 
The 405-item bibliography contains excellent and recent references 
for students at all ability levels. Cecu M. FREEBURNE 
Bowling Green (Ohio) State University 


Lrororn BELLAK, Ed. Psychology of Physical IUness. Psychiatry 
Applied to Medicine, Surgery, and the Specialities. New York: 
Grune and Stratton, 1952, pp. 243. 


This is a very little book on a very big subject. Perhaps the 
magnitude of the field justifies the two titles carried by this work, 
but most certainly its contents do not. The subtitle is accurate; 
but the primary title is highly misleading. Of psychology, 98 PY” 
chologists know it, this book contains nothing. The last statement 
must be tempered only by recognition that the one chapter which 
appears to show some semblance of effort at scholarly review : 
existing literature (Chapter 5, on Psychiatric Problems in Genera 
Surgery) has one reference to a paper co-authored by a psy chologist 
and dealing with a scientific, psychological study of determinant 

© of post-surgery recovery. This paper receives no treatment 1n ele 
is simply given super-script reference together with three 
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papers which, like the great bulk of the references used by 
sauthors, are clinical accounts of and superficial hypothe- 


fect. 
‘is a book by M.D’s for M.D.’s. More specifically, it is.a 
of brief papers by psychoanalytically trained ororiented_. 
al specialists, physicians who obviously consider analytic con- 
as both valid and efficient as a basis for 1) investigating, 2) 
derstanding, and 3) controlling the doctor-patient relationship. 
these papers, emphasis is upon understanding Pe er 
matient’s psychological problems primarily within the framewor! 
of the abstractly descriptive terminology of psychoanalysis. Sci- 
fic investigation (as contrasted with clinical accounting) and 
ommendations as to techniques of management are generally 
’ short shrift in preference to relatively stereotyped accounts 
of the ‘meaning to the patient’ of his particular symptom or de- 
fect. Much use is made of the ‘body image’ concept with little 
effort to outline its developmental psychological basis. 
he possible and probable purposes which a book carrying the 
f this one might have, it is difficult to find that any one has 
een pursued. It is very obviously not a comprehensive review of 
" Psychological data or information concerning physical illness or 
handicap in the sense of Pintner, et al., The Psychology of the Physi- 
cally Handicapped, or Barker, et al., Adjustment to Physical Handi- 
and Tilness. As a matter of fact, the goal and effort represented 
in outcome in these two works is considered by the editor of this 
90k as “nearly impossible’. This book is obviously not a report of 
medical and psychological research carried on by a team of investi- 
$ concerned with any particular sample of patients, as exem- 
by Keys, et al., Biology of Human Starvation. It is obviously 
attempt to review the different theories and systematic 
ulations of outstanding psychological and medical experts con- 
ing psychiatric implications of illness. 
‘What of the stated purpose of the book? According to the editor, 
offered “with the hope that we may contribute to a better medi- 
ractice—one which is more enjoyable for the doctor and more 
5 cial to the patient.” The exact way in which this contribution 
0 be achieved is not stated explicitly. Certainly it is clear from 
Content that the contribution is not to be in the form of factual € 
tormation (e.g., frequency of significant psychiatric com 
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in surgical procedures). Nor is the. contribution to be in terms of 
reasonably detailed recommendations concerning the management 
of the emotionally involved patient (e.g., possible techniques for 
effecting a psychiatric referral of a patient with multivle sclerosis). 
Nor is this book’s contribution to a ‘better medical practice’ to be 
in the fofm of challenging questions and the mapping of unexplored 
areas. This book raises no questions and in no way suggests that 
there are arly areas of ‘psychology’ in medical practice that require 
objective investigation. So far as may be determined from this col- 

s ittle essays, the ‘psychology of physical illness’ is that 
Contradiction in tefms—a finished science. 

What of the actual content? This is an edited collection of fifteen 
papers by sixteen authors. The medical areas covered include gen- 
eral practice, internal medicine, malignancy, general surgery, plas- 
tic surgery, gynecology and obstetrics, urology, orthopedics, eye, 
ear, nose and throat, neurology, pediatrics, dermatology, dentistry. 
The final chapter is on the “Personality of the Physician as & 
Factor.” In view of the pervasive psychoanalytic orientation of the 
book, one can well imagine the editor’s frustration at having to 
dispense with a chapter on proctology because he was not at- 
quainted with any expert in the rectal field who was “at the same 
time competent in psychiatry.” 

With the exception of the editor, the writers are not nationally 
prominent in either psychology or psychiatry. This may account: 
for the brief biographical sketch which precedes each chapter and 
helps to establish for the reader the special competence of the par- 
ticular writer. The biographical similarity of the writers is most 
striking. All sixteen are residents of New York City. With the ex- 
ception of two private practitioners, the remaining fourteen con- 
tributors have formal affiliation with one or another of five promi 
nent New York medical institutions. Half of them are associated 
with either Mount Sinai Hospital or Columbia University. Seven 
of the authors are formally affiliated with the New York Psycho- 
analytic Society. Only half of the authors record any previous pub- 
lications in the references appended to theif chapters. The chapter 
reference lists which range from six to fifty-eight items in length, 
and average nineteen items, are somewhat expansively term 
‘bibliographies’. 


The general impression left with this reader by both the chapter 
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the biographical data on the writers is that their 
ications consisted of convenient geographical ac- 
o the editor and a happy homogeneity of philosophical 
{ lytic) conviction. The collaboration and editorial 
haye resulted in a work of surprisingly even quality in 
the very marked variation of style, rnani, em 
and thoroughness which is a common criticism of similar 
diums. Unfortunately, the desirable evenness of quality is 
by a failure on the part of the participants to raise them- 
the platitudinous level of observations: i 
feelings’ about his sickness and that it is important for ~ 
to recognize and deal with these feelings. The chapter 
ity, for example, is lush with unqualified staternents about 
“usual patient feels about his mouth,” about tooth decay, 
ntal therapeutics, but there is no indication of even 
on the part of the author of the extensive research 
iure on psychological determinants of individual differences in 
action, ‘Pain’ has but a single reference in the index of this 
id the term ‘limen’ appears not at all. The one chapter which 
above all the others because of its unhackneyed observa- 
subtlety of its insights into the implications of established 
f psychology, and its emphasis upon the importance of the 
information and control of a medical procedure as an 
alleviant, is the chapter on general surgery. Typical of this 
tis his recommendation that operating room personnel should 
of the fact that during the induction of general anesthe- 
ng is the last sense category to be obliterated and that 
d impressions can come by way of the partly narcotized 


8, or whose titles arouse natural expectations as to content, 
bear the tést of such comparisons and expectations. A com- 
n of this little book with a big title against such works as 
tner and Barker indicates differences both of goal and 
ent of the magnitude of the differences between a Sunday 

the French expedition to the top of Annapurna. In view 
trongly critical flavor of this review and the current revival 


burning’ one might ask if this is a dangerous gan € 
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book. It is not. It may be perused safely by medical students, gen- 

eral practitioners, and specialists—even by the physically-ill per- 

son. It is unlikely to add to the knowledge or understanding of 

any of these, but it most definitely will not harm them. It is not 

that good. WILLIAM SCHOFIELD 
® University of Minnesota 
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\CTOR ANALYSIS OF STUDENT RATINGS 
F PSYCHOLOGY INSTRUCTORS ON THE 


PURDUE SCALE Py 


A. W. BENDIG 


University of Pittsburgh 


revious study of the use of student rating scales as diag- 
d evaluative instruments to assess instructional effective- 
the college level it was suggested that ratings from scales 
highly interrelated might be combined to yield a fewer 
f indices of teaching proficiency that would be more 
han single scale ratings (1). Coffman (4) presents a 
point of view, arguing that a multiplicity of single scales 
difficulties in interpretation. In addition, results have 
that a simple summing of ratings from different scales, 
Consideration to varying intercorrelations among pairs of 
n obscure important information. The residual variance 
g in achievement grades in a course in introductory 
ogy after the variability attributable to academic aptitude 
n eliminated was shown to be significantly related to 
‘Single scales on the Miami Instructor Rating Sheet and to 
ted to others. A simple combining of the ratings resulted 
in the relationship between student ratings and residual 
ent (7). 

ethods could be used to combine ratings from different 
One would be to combine scales that rationally appear to 
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tionships between course achievement in introductory psychology 
and ratings of the course and of the instructor (2). A better method 
of combining scales would stem from a factor analysis of the 
intercorrelations of the scales. Such a procedure should lead to a 
minimum number of factorially pure and possibly orthogonal 
easures of teaching effectiveness as seen by the student. The 
present paper is a report of the results of such an approach to 
combining scales on the Purdue Rating Scale for Instruction. 


iNet PROCEDURE 
cart data were the mean ratings of eleven introductory 


g? 


| 


psychology instructors on the first ten scales of the Purdue Rating 
Scale for Instruction. These instructors had been rated by their 
students during the Spring and Fall semesters of 1951. The number 
of student raters varied from seventeen to ninety-eight, with the 
mean being 42.8. Detailed information on the norms derived from 
these ratings, including reliabilities of the individual scales, can 
be found in a previous report (1). 

The interrelationships among the ten scales were found by 
computing product-moment correlation coefficients between the 
eleven mean ratings on each scale. These correlations can be 
found in Table 1. This matrix of intercorrelations was factor ana- 
lyzed by the centroid technique outlined by Cattell (3, pp- 51-59) 
and the extraction process replicated four times to stabilize scale 
communalities. On the fourth iteration the mean of the absolute 
differences between estimated and obtained communalities was 


TABLE 1.—ĪNTERCORRELATIONS (Ductmat Pornrs Omrrrep) or MEAN 
Ratines or ELEVEN INTRODUCTORY PSYCHOLOGY INSTRUCTORS ON 
THE PurDUm RATING SCALE FOR INSTRUCTION 


EA 3 4 s 6 7 8 9 n 
1 16 | 2 | 30 | 79 | 63 | 59| 20 | 32 2 
Pi 67 54 43 | 66 17| 83 | 64 7 
3 19 | 16 | 36 | -06| 74 | 70 z 
4 46 78 16 | 43 | 02 6 
5 s3 | si| 33 | 39 | E8 
6 47| 50 | 37 | 7 
7 31 | 09 | 86 
8 53 41 
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GINAL AND Roraren Facrors Founp oN THE PuRDUE 
RATING Scare ror INSTRUCTION 


Original Factors Rotated Factors 


Ir m I pig pieg 

39 33 7| 0| ga 
ORS] -02 | 70| 93 
—64 07 o | s| 70 

14 | —58 oo | o0] 69 
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| 
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Extraction of factors was terminated after three factors 
extracted, since an inspection of the residuals plus the 
of the criteria suggested by Vernon (9, p. 130) and 
er (8, p. 298) indicated that the residual variance lacked 
d factor loadings were graphically and analytically 
to achieve orthogonal simple structure. The original and 
ed factor loadings can be found in Table 2. Rotated Factor I 
s to be a general factor with medium or high loadings on all 

scales with the exception of Scale 9. This appears to be a 
o’ factor that is common to most of the P.R.S.I. scales. 
0 factors II and III showed a very clear simple structure. 

r TI had high loadings on the following four P.R.S.I. scales. 


Description 


Presentation of subject matter 0 
Self-reliance and confidence 0. 
Interest in subject 0 
Stimulating intellectual curiosity 0 


e four scales seem to hang together on rational grounds. 


m refer to aspects of the instructors’ behavi 
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in his performance as a lecturer and only as a lecturer. In naming 
this factor (always a subjectively hazardous task), we might 
suggest the appellation of ‘Instructional Competence’. 

Four other P.R.S.I. scales showed high correlations with Factor 
Ill: 


ARAY R 


Loading 


Fairness in grading 0.81 
Personal appearance 0.81 


3 

a 9 
Personal peculiarities 0.71 
92 Sympathetic attitude toward students 0.70 


Again a common thread runs through the scales loaded with 
this factor. The theme appears to be the personality of the in- 
structor as viewed by the student, The personal interaction of 
instructor and student would seem to be important here: what 
impression that the student has of his instructor’s objectivity, 
empathy, and personal characteristics (both structurally and 
functionally). If forced to, we should prefer to call this factor . 

f 
| 
i 


‘Instructor Empathy’. 

One comparable study is the factor analysis of the P.R.S.. 
reported by Smalzreid and Remmers (8). Since this previous study 
analyzed the data from a high-school sample and only extracted 


Tantm 3.—Comparison or Facror Loapinas asove 0.50 Founp 1N Two 
STUDIES or THE PURDUE RATING SCALE FOR INSTRUCTION (DECIMAU 
Ponts OMITTED IN TABLE) 


Smalzried and Remmers Present Study 
P.R.S.I. Scale 
FactorīI |, Factor II Factor |, Factor Uy 

1 76 77 

2 7 70 

3 88 81 

4 73 

5 89 86 ) 

6 69 

T 89 78 

8 72 53 o 

9 79 E s 
) 10 67 72 
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m the matrix of intercorrelations (militating against 
ture solution), it could not be expected that the 
ried and Remmers would be comparable to the 
ts. However, an inspection of their data suggested an 
g comparison. In Table 3 are given the high loadings 
arbitrary figure of 0.50) for the two factors found by 
and Remmers and also the high loadings for rotate 
d II in the present study. A remarkable parellelism 
in the two sets of data. Smalzried and Remmers found 
igh loadings for their factors I and II than did the 
factors II and III. A clue is offered to explain 
from the data in Table 2. P.R.S.I. scales 4 and 6, for 
Izried and Remmers found high loadings and the present 
| not, are the two scales with the highest loadings on our 
factor I. Since this general factor was not separately 
‘by Smalzried and Remmers, its variance is confounded 
riances of the other two important factors and partially 
ey structure. It is suggested that the extraction of a 
r by Smalzried and Remmers would have given, after 
mt rotation to simple structure, a factor structure for the 
‘remarkably similar to the one found here. 


Seale factor loadings in Table 2 can be looked upon as the 
ion of each P.R.S.I. scale with a ‘pure’ measure of a par- 
or. In a sense they are validity coefficients useful in 
ing the criterion of a factor score for an instructor. From 
ings, and the intercorrelations of the scales reported in 
, multiple regression equations for predicting factor scores 
ved by the standard method outlined by Peters and Van 
roothis (6, pp. 225-229). Scales 1, 5, and 7 were found to have a 
multiple correlation of 0.89 with factor II and a similar correlation 
Wasfound for P.R.S.1. scales 3, 8, and 9 in predicting factor III 

‘Both coefficients were reduced to 0.86 by the shrinkage 
en by McNemar (5, p. 161). Table 4 gives the beta (b) 


each of the P.R.S.I. scales, the raw score (B) aii 
ed in 
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TABLE 4.—ReGression WeicuTs Usep IN Prepictine Factor Scores 


Factor IT Factor III 
P.R.S.I. Scale 
b B b B 
1 0.29 0.0377 
> 2 
D> 3 0.31 0.0795 
4 
5 0.38 0.0277 
6 
7 0.31 0.0360 
a 8 0.23 | 0.0397 
8 0.47 | 0.064 
i 10 
Correction —7.79 —16.10 
oo eGA a 
Multiple correlation: 
Obtained 0.89 0.89 
Corrected for shrinkage 0.86 0.86 


noted that what we hereafter shall call ‘factor scores’ are only 
estimates of the score an instructor would receive if we had a pure 
measure of the factor, but that these estimated scores should 
correlate to the extent of 0.86 with ‘true’ factor scores. 

An interesting question raised was whether the objective tests 
that nine of the eleven instructors had taken upon entering gradu- 
ate school could be used in predicting their factor scores. Three 
tests had been used in this entrance battery: (a) the Miller Analo- 
gies Test, (b) a test of mathematical ability, and (c) a reading 
comprehension test containing three subtests measuring vocabu- 
lary, reading speed, and level. The two factor scores estimated for 
each instructor were correlated with these test scores by both 
product-moment and rank-difference (rho) methods. The correla- 
tion of each factor with the entrance tests, plus the intercorrela- 
tions of the factors, can be found in Table 5. The you 
subtest from the reading comprehension test correlated post 
and significantly with both factors II and III, but this result Ta 
investigation in another sample before such a test could possibly 
be used as a selection device for instructors of introductory PY 

> chology. 
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opuct-MoMENT AND RANK-DIFFERENCE CORRELATION Co- 
CIENTS BETWEEN RAW SCORES ON GRADUATE CHOOL 

ANCE TESTS AND ESTIMATED FACTOR SCORES FOR 

NINE Instructors 


Product-Moment Rank-Difference 


0.30 0.19 | 0.29 0.04 
—0.28 0.50 | 0,02 0.40 


45 —0.42 | 0.37 —0,49 


t at the .05 level of confidence. 
cant at the .01 level of confidence. 


DISCUSSION 


“major purpose of this study was to find whether factor 
tic techniques applied to student ratings on the P.R.S.I. 
ide an empirical method of combining separate scales 
a smaller number of reliable and independent measures of 
or effectiveness. For this method to prove successful a 
orthogonal simple structure solution must be found. The 
eported in Tables 2 and 3 lead to the conclusion that these 

nts are met by the factorial structure of the P.R.S.I. 


‘of the P.R.S.I. are accounted for by three orthogonal fac- 
eral factor permeating most of the scales and two group 
‘measuring independent dimensions of the teaching situa- 
A 
be acknowledged that only a relatively small number of 
ors in a single subject matter area were used in the 
, although the number of student raters participating 
e sizeable. However, the similarity in factor structure of 
B.I. in this and in the only other published report sug- 
at the structure found here is neither a function of the 
of the sample of instructors nor of using instructors 
‘single academic department. We should predict that 


Factor II | Factor III | Factor II È Factor To 


0.77* 0.68* | 0.81** 
0.55 0.33 | 0.51 48 
0 


ee 
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another sample would yield slightly different beta weights from 
those reported in Table 4, but the essential factor structure of 
the P.R.S.I. would remain unchanged. 

The similarity of the factor structure of the P.R.S.I. found in 
this study to the structure previously reported by Smalzried and 
Remmers,(8) has already been noted. In addition, Coffman (4) has 
ported a factor analysis of nineteen instructor ratings scales 
and the identification of four factors. Two of these four factors, 
Empathy and Organization, appear to be similar to the two group 

found in our studies. Coffman’s third factor, Academic 
Ti , may be similar to our general factor. 

The practical usefulness of the results lies in the stimulus for 
future research. We feel that we now have two fairly pure, inde- 
pendent, and quantifiable measures of instructor effectiveness 
based upon student ratings whose relationships with other meas- 
ures of teaching aptitude and ability can be investigated. A tenta- 
tive, but suggestive, beginning is reported in Table 5. Future 
research will be concerned with (a) the relation of these factors to 
other criteria of teaching competence, (b) the effect upon these 
criteria of characteristics of the student (age, sex, aptitudes, 
achievement, etc.), and (c) the development of aptitude measures 
useful in predicting a multiple criterion of teaching competence m 
psychology. 


SUMMARY 


The mean ratings of eleven introductory psychology instructors 
on the first ten scales of the Purdue Rating Scale for Instruction 
were intercorrelated and the resulting matrix factor analyzed by 4 
centroid method. Three factors accounted for eighty per cent of 
the variance: a general factor permeating the scales, and tw? 
group factors that were labelled ‘Instructional Competence’ and 
‘Instructor Empathy’. Regression equations were developed to 
estimate factor scores of instructors on these last two dimensions 
and in both cases three P.R.S.I. scales were found to show ® 
multiple correlation of 0.86 with the factors. Nine instructors ha 
taken a battery of graduate school entrance examinations and 
their test scores were correlated with their factor scores. One test, 
reading vocabulary, was found to correlate significantly with both 
factor scores. 
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Oa, sista was concerned with the relationship between teach- 

ers’ ratings of adjustment and aspirational traits in adolescents 
and various other self-report, objective, and projective measures 
of the same characteristics. 

P Teachers’ ratings are frequently used for diagnostic, prognostic, 
guidance, and research purposes because of their ready accessibility 
and quantifiability. Moreover, when employed as averages of 
several raters and in relation to traits (a) which are overtly mani- 
fest and objectively observable in everyday behavior and (b) 
which can be specifically and unambiguously defined (8, 20, 21), 
such ratings enjoy considerable reliability. A further advantage of 
teachers’ ratings is the fact that they can be based on frequent 
and prolonged observation of, behavior under a variety of condi- 
tions, : 

With respect to teachers’ ratings of adjustment, the main 

_* problem wag,to ascertain the kinds of relationships that prevail 
between these observational appraisals of personality traits im- 
volved in adaptation to the school environment and such other ; 
criteria of adjustment as absence of overt behavioral symptoms- 
tology or relatively low degree of deviancy with respect to the 
more general personality trends measured by projective instru- 
ments. To what extent do these different instruments measure 
similar, overlapping, or even entirely discrete aspects of the 
global concept of adjustment? A secondary problem had to do 
with the bases on which teachers’ ratings of adjustment are made. 
To what extent are they related to ratings of aspirational traits an 
to pupils’ scholastic competence and sociometric status? 

1 This research was supported by grants from the Bureau of Educations! 
Research and from the University Research Board of the University 
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econd major problem dealt with in this study was the 
nship between teachers’ ratings of aspirational traits and 
ife’ measures of level of aspiration. In a sense this question 
as the problem of the validity of such ratings. 


METHOD e 


ulation —The subjects in this study consisted of fifty stil? 
i comprising, with the exception of a few absentees, the 
re junior class of University High School? in Urbana, Tilinois. 
n age of these students was 15.8 years, and the dj 
sex was twenty-four boys and twenty-six girls. A maj®rit; 
ents in this school come from professional homes, the 
generally holding academic appointments at the University 
s, Admission to the school, however, is unrestricted ex- 
r the payment of a small nominal tuition fee. 
"eachers’ ratings,—Five current teachers of each student were 
to make ratings at the end of the school year. In all, the 
igs of fifteen teachers (eight male and seven female) were 
d. Students were rated on a five-point scale on general 
nt (defined in terms of personality integration and 
stability) and on three aspirational traits (persistence, 
tic competitiveness and academic aspiration). To minimize 
effect’, the teachers were requésted to rate all students on & 
trait before proceeding to consideration of the next trait. 
ratings of five teachers were averaged to provide a mean , 
ig for each student on four traits. ka 
wal life’ measures of aspiration.—‘'Real life’ indices of aspira- 
il level included both academic and vocational measures, The 
et consisted of (a) academic goal discrepancy—the algebraic 
between the student’s expressed academic aspiration for 
ent school year and his composite grade point average for 
preceding four semesters; and (b) academic performance 
e—the algebraic difference between the student's estimate 
cumulative high-school standing and his actual four-semester 
t average. Prediction of academic standing was made 
if six-point scale (well below average to upper few), and aca- 
‘authors ac ö ion and administrative assistance 
Oae SE i anaes of the staff of University High 


in collecting these data. 
To- 
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demic aspiration was expressed in terms of a five-point scale 
(get by to upper few). Grade point average was computed from 
actual grades which in this school are officially recorded on a 
five-point scale but are not divulged to students. 

Three measures of level of vocational aspiration were utilized: 


O vocational prestige needs—the student’s total weighted score 


the level of interests, part of the Lee-Thorpe Occupational 
Interest Inventory? (14); (b) vocational tenacity—a composite 
standard score on a hypothetical level of aspiration test consisting 


hypothetical situations in which the subject is successively 
to suppose that he is preparing for a career in medicine, 


‘ engineering, and a skilled trade, respectively, and that in the 


P 


course of his vocational preparation he meets serious obstacles of 
a specified nature. In each of situations, four alternatives are 
presented ranging from maintenance of the original goal at all 
costs (high vocational tenacity) to complete abandonment of the 
original vocational goal (low vocational tenacity). The subject's 
choice in each situation, appropriately weighted, is converted into 
a standard score, and all three standard scores are added to yield 
a composite score. (c) vocational unreality—the total discrepancy 
between each of the subject’s nine percentile scores on the Kuder 
Preference Record (14) and the corresponding mean percentile 
scores of persons in his occupation of choice as given in the manual 
for this test. 

Measures of adjustment—In addition to teachers’ rating of 
adjustment, the following indices of adjustment were employed: 
(a) M.M.P.I. adjustment score—total adjustment score on the 
Minnesota Multiphasic Personality Inventory (11). In deriving 
total adjustment scores, raw scores on each of the subscales were 
first converted into standard scores. Since a positive standard 
score on a given subscale indicated a degree of deviancy with 
respect to the trait measured that was greater than the mean 
score characterizing our population, a measure of total personality 
deviancy could be computed for each subject by merely summing 


3 This is a measure of general level of job prestige desired by an in i 
vidual. The subject is presented with thirty triads of occupational activiti 
in each of which area of work involved is held relatively constant while j 
of job prestige is varied. Total score is computed from the subject’s prefer 
ences in each triad appropriately weighted in terms of job prestige- 


D 
(ee eoo 
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e standard scores on the various subscales.‘ Finally, 
the sign of the total standard score, a measure of 
‘was obtained in which a high score was indicative of 
r good adjustment. (b) Rorschach adjustment score—an 
ent score derived from the Rorschach group test admin- 
cordance with the procedure developed by «Harrower 
r (10), and scored in accordance with the system d 
Klopfer and Kelley (13). The adjustment score repre- 
total number of entries made in Munroe’s check list* 
essed as a standard score with the sign reversed 
score is indicative of good adjustment. (c) Rors 
score—the total number of occasions specified signs of 
y appeared in the subject’s protocol. Criteria of anxiety 
fifteen commonly accepted signs such as diffuse shading 
signs*of shading shock, high number of card rejections, 
or evasive responses, high percentage of oligophrenic 
s, etc. (9, 13, 19). Each time one of these fifteen anxiety 
the subject was credited with one point. Thus, 
hach anxiety scores are indicative of relatively high 
vels of anxiety and vice versa. (d) Composite adjustment score 
h algebraic sum of the subject’s standard M.M.P.I., Rorschach 
nt and teachers’ adjustment rating scores. (e) LPI. 
score—total score on a paper-and-pencil inventory de- 
to measure anxiety level (Illinois Personality Inventory). 
ciometric status—the mean sociometrie rating earned by 
mt when rated sociometrically by all of his classmates 
point scale (‘definitely not wanted as a friend’ to ‘wanted 
st friend’). (g) Grade point average—the student's cumula- 
e point average over four semesters computed on a five- 
scale, As in the case of sociometrie status, a numerically 
re is at the desirable end of the scale. 
negative standard score is indicative of deviancy on the mas- 


femininity subscale, negative standard scores on this scale were 
other standard scores to obtain total M.M.P.I. adjustment 


scoring determinant. Entries in the check list, therefore, repre- 
cy from the proposed limits of normalcy. 


398 The Journal of Educational Psychology 


TABLE 1.—INTERCORRELATIONS AMONG PERSONALITY Traits as Ratep 
BY TEACHERS 


Scholastic i 
Adjustment | Persistence | Competi- | Academic 


tiveness | Aspiration 
Adjustmeat -90* 71 60 .42 
=Dersistence = .87* 81 73 
Scholastic Competitiveness _ — .83* .79 
Academic Aspiration — — — .90* 


: All coefficients of correlation in this table are significant beyond 
e pne per cent level of confidence. 
* Mean intercorrelation between five teachers’ ratings of subjects on 


s 2 


same trait computed by ‘squared r’ method, and corrected by Spearman- 
Brown formula to indicate the predicted reliability of the combined judg- 
ments of five raters. 


RESULTS AND INTERPRETATION 


Reliability and Generality of Measures.—The reliability (gen- 
erality over-raters) of teachers’ ratings is shown in Table 1. For 
each trait the mean intercorrelation (computed by the squared r 
method) between the ratings of five teachers on the same pupils 
was obtained. This was corrected by the Spearman-Brown formula 
to indicate the predicted reliability of the average rating of five 
teachers. Application of the correction formula was considered 
justifiable since an average of five ratings is more reliable than & 
single rating in the same sense and to the same predictable degree 
that a ten-item test is more reliable than a two-item test. The 
resulting reliability coefficients approximate the range of magn 
tude generally obtained with combined teachers’ ratings. 

Table 1 also shows that teachers’ ratings exhibit considerable 
generality over traits. This is not necessarily indicative of ‘halo 
effect’ since the three aspirational traits both closely resemble 
each other and are not logically unrelated to adjustment. Furl 
more, the intercorrelations among aspirational traits are higher 
than the correlations between these latter traits and adjustment. 

The reliability® of academic goal discrepancy and of academi? 


6 A more definitive presentation and discussion of the reliability, ar 
ality, and psychological significance of ‘real life’ measures of level of a 
tion are given in a previous paper (6). The more relevant findings 21° n 
marized here to provide some objective basis for appraisal of these measu! 


E RES? 
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ce estimate scores could not be ascertained, but there 

h degree of relationship between these two different 

level of academic aspiration. Vocational tenacity scores 

different kinds of situations (i.e., medicine, engineer- 

‘skilled trade) exhibited a degree of generality which was 

ate to the degree of similarity between the Vocations. ; 

retest reliability of the level of interests scores on th 

Interest Inventory (identical with our measure of 

al prestige needs) is given as .88 in the test manual (16). 

ability of vocational unreality scores could not be 

Vocational prestige needs were significantly kide E 

ational unreality and vocational tenacity in boys, but 

ns between academic and vocational measures of level 

ation were not significant. 

ot feasible for either technical or administrative reasons 

n reliability coefficients for the Rorschach measures, the 

adjustment score, or the composite adjustment score. 

relations among Rorschach adjustment score, M.M.P.I. 

ent score and teachers’ rating of adjustment (see Table 2) 

nerally non-significant except for the correlation between 

er two measures in our female subjects. The correlations 

each of these three measures and composite adjustment 
spuriously high because each measure was & component 

‘the composite score. M.M.P.I. adjustment score was the 

resentative component of the composite measure in 

nd teachers’ rating of adjustment enjoyed comparable 

mong girls. 


‘ABLE 2.—INTERCORRELATIONS AMONG ADJUSTMENT SCORES 


i Socio- 
Rorschach M.M.P.I. Teachers Bet! Bere Composite 
‘Anxiety Adjust- | Adjust- a ove ) Le Adjustment 


ment ment 


Boys |Girls|Boys|Girls|Boys|Girls| Boys | Girls |Boys/Girls|Boys Girls 


Asi justment |—.24|—.19) 
justmen: 
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Thus, it appears that the aspects of adjustment measured, | 
respectively, by teachers’ observational ratings of classroom 
behavior, a self-report index of behavioral deviancy with respect 
to various syndromes in psychopathology, and a projective meas- 
ure of general personality integration are relatively unrelated to 

each otHer in our sample of adolescents. If all three measures are 
ually valid, they must all be measuring quite different aspects | 
of adjustment which (except for M.M.P.I. and teachers’ ratings 
in girls) show no significant overlap whatsoever. The same type 
tionship holds true for self-report (I.P.I.) and Rorschach | 
mexsures of anxiety which were not significantly correlated. This 
latter datum is not in accord with the previously reported finding 
of a significantly positive relationship between these two measures 

of anxiety in a population of college students (4). 

More in accordance with expectations, I.P.I. anxiety scores 
correlated negatively with composite and M.M.P.I. adjustment 
scores, and with teachers’ ratings of adjustment in girls, and with 
Rorschach adjustment scores in boys. The relationships between 
Rorschach anxiety scores and other measures of adjustment were 
not significant. At least among our adolescent girls, therefore 
high levels of anxiety as measured by the I.P.I. tended to be © 
associated with poor adjustment and vice versa. A possible ex- 
planation for this sex difference might lie in the prevailing cultural 
attitude that high levels of striving (and of the anxiety that 
accompanies such striving) are more acceptable and compatible 
with good adjustment in boys than in girls. In support of this 
interpretation was the fact that teachers’ adjustment ratings wer 
more consistently negatively related to boys’ than to girls’ aca- 
demic goal discrepancy scores and academic performance esti- 
mates (see Table 4). 

Relationship between teachers’ ratings and other measures of 
adjustment.—As already noted, teachers’ rating of adjustment 
was not significantly related to Rorschach adjustment score m 
either sex, but was positively related to M.M.P.I. adjustment 
score in girls (p. < .01). Teachers’ rating of adjustment was also 
significantly more highly related to composite adjustment score 
in girls than in boys, although in each case the coefficient of cor- 
relation was spuriously high. Rorschach anxiety score was as 

D significantly correlated with teachers’ rating of adjustment, H 


ta a 
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ety score was negatively related to the latter score in 

< 01). 

d appear, therefore, that for our population of adoles- 
teachers’ ratings of adjustment are a more valid and psy- 
cally meaningful measure in relation to girls than in rela- 
joys. Appraisal of boys’ adjustment status is vey Poss a 


ated to extremely high levels of striving (and the asso- 
mxiety) are regarded as unfavorable for adjustment by 
| hygienists, but are nevertheless accepted as desi i 

m with our cultural ideal of masculine success. Ia t 
girls, on the other hand, teachers are apparently less con- 
‘by contradictory criteria of adjustment. Supporting this 
ation are (a) the significantly negative correlation be- ‘ 
girls’ I.P.I. anxiety scores and teachers’ adjustment ratings, 
trast to the approximately zero coefficient of correlation 
| the corresponding scores of boys, and (b) the more con- 
‘tendency for teachers’ ratings of adjustment to be corre- 
egatively with objective indices of academic level of aspira- 
the case of girls (see Table 4). 
clue regarding the bases on which teachers’ ratings of adjust- 
t are made appears in the moderately high degree of positive 


who receives a high teachers’ rating on adjustment is an 
ridual who is perceived by teachers as persistent and scholas- 
competitive, who is highly accepted by his classmates and 
)has a superior scholastic record. Table 4 also shows that his 
ic aspirations tend to be more commensurate with his past 
nic performance as evidenced by the negative correlation 
Ween academic goal discrepancy and teachers’ adjustment 
ng. In addition, if the pupil is a girl, she will tend to have high 
P.I. and composite adjustment scores and a low I.P.I. 
‘score. y 

e 3 also shows the relationships between teachers’ ratings 
Spirational traits and other non-rating scale indices of ad- 
ent. Aspirational ratings were not significantly related to 
PI. and Rorschach adjustment scores or to Rorschach @ 


T 
À ¢ af 
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Taste 3,—CORRELATIONS BETWEEN Txacuens’ Ratinos ann Vaniovs 
MEASURES or ADJUSTMENT 


Teachers Ratings 


Scholastic 
rec vue Gameet | Aimi 
“bh — 
* Girls Boys | Girls | Boys | Girs 
M.M.P.I, Adjustment 4 — .02|—.01|—.05]—.16)—.24 
Adjustment |— jl ki —.14|—.28| .22/—.12/ %0 
Anxiety : ‘02 -:14- 10/—.12/—.17 
Anxiety Level (I-P.I.) | .03)**—.52)—.30|**—.52|—.38)—.20|—.30)—.94 
Composite Adjustment | *. **.57| _.23)**.53)—.04)**.70 
Sociometric Status * 48) *.46) *,42) * 39 
me Grade Point Average “oan 
: t at the five per cent level. 
7 icant at the one per cent level. 


anxiety ratings. A puzzling negative relationship (p. < .01) pre 

vailed between girls’ I.P.I, anxiety scores and teachers’ ratings on 

persistence, Teachers’ ratings of aspirational traits were signifi- 

cantly correlated with composite adjustment in girls; in the case 
. of boys this was only true with respect to persistence. 

In general, as one might expect, grade point average was more 
highly correlated with aspirational ratings than with ratings of 
adjustment, but the reverse held true for sociometric status (se 
Table 3), 

Relationship between teachers’ ratings and ‘real life’ measures of 
aspiration.—Table 4 shows that there is a pronounced tendency 
for academic goal discrepancy scores to be negatively correlated 
with such highly related teachers’ ratings as scholastic compel 
tiveness, persistence, and academic aspiration. ‘Thus, students 
who expressed relatively high academic aspirations for the future 
in the light of previous scholastic performance tended to receive 
ratings from teachers which were indicative of relatively 10% 
aspirational level. This same tendency prevailed with respect to 
academic performance estimate, but was significant only for 
girls, and in only two out of three instances. Generally paka 
relationships between teachers’ ratings of aspirational traits 


Oe 
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Adjustment Persistence cares hag Fon eyed 


Boys Giris |‘ {Boys 


Girls | Boys | Girls 
ems ee 


int at the five per cent level. 
at the one per cent level. 


0 vocational aspiration were not statistically significant 
hibited no consistent trend with respect to direction. 

mative relationship between ‘real life’ measures of 
nig aspiration and teachers’ ratings of aspirational traits 
bly based on motivational behavior in an academic setting) 
bt upon the validity of the latter measures, It confirms 
ma previous study with intellectually gifted sixth-graders 
h the writer failed to obtain significant relationships 
ers’ ratings of aspirational traits and a measure of 
vation (i.e. relative tendency to respond to an 
‘personal prestige by increasing work output over the 
under conditions of anonymity). Another supportive 
s latter study was a zero correlation between these 
and pupils’ self-ratings on aspirational traits (1). 
on the other hand, obtained very significant differ- 
‘educationally retarded’ and ‘educationally accelera- 
children aged nine to fourteen with respect to teachers’ 
everance and ambitiousness. 

| educational performance (holding other factors 
i perhaps a more definitive criterion of the validity of 
pirational traits than either self-ratings or te 
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test measures, it should be realized that such evidence of validity 

is largely circular when the ratings are made by teachers, Since 

teachers’ ratings of motivation are substantially based on academic 

performance, the latter criterion can hardly be considered inde- 

pendent. In the present study, correlations of .61 to .92 were 

ay OP tained between teachers’ ratings of aspirational traits and 

grade point average (see Table 3). 

Taking all of the evidence into consideration, there is certainly 

good reason for questioning the validity of teachers’ ratings of 

’ aspirational characteristics. This lack of validity is apt to 

be®ven more serious in the case of adolescent pupils, judging from 

the findings of the present study and from the relatively poor 

insight of teachers into high-school pupils’ interests (6) and socio- 

P metric status (7). Also, at least with respect to the latter function, 

teachers’ perceptual ability has been shown to decrease markedly 
with increasing age of pupils (3, 17). 


SUMMARY AND CONCLUSIONS 


Teachers’ ratings of adjustment and aspirational traits were 
obtained for a class of fifty juniors in a University high school and 
related to various other self-report, objective, and projective 

. measures of the same characteristics. The following conclusions 
can be reached with respect to our particular sample of adolescent 
boys and girls, and deserve to be tested on a more representative 
population: i 

1) Averages of five teachers’ ratings of adjustment and aspit 
tional traits enjoyed high split-half reliability. J 

2) Rorschach adjustment score, teachers’ adjustment rating, 
and M.M.P.I. adjustment score were not significantly interco" 
related except for the latter two-measures in the case of ak 
Thus, all three instruments appear to be measuring quite differen 
aspects of adjustment showing little or no overlap. 


3) High level of anxiety, as measured by the Illinois Personality 


Inventory (I.P.I.) was consistently associated with poor an 
ment in girls. This was not true of boys. The miee high 
it! 


offered that anxiety, especially the variety associated W. 
striving, is more compatible with and culturally accep 
a evidence of good adjustment in boys than in girls. ie 
J ratings of adjustment appear to be more meal 


Ue ee 
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d psychologically valid for girls than for boys in as much as 
correlated in the appropriate direction with M.M.P.I. 
y te adjustment scores and with I.P.I, anxiety score. 
ex difference is attributed to the prevailing cultural ambiv- 
‘regarding the adjustive value of anxiety for boys in the 
f their social sex rôle. In the case of girls, teachets’ ratin; 

istment also tended to be more consistently correlated 

tive direction with ‘real life’ measures of level of academic 


e highly accepted by their classmates, who had superior 
stic records, and whose academic aspirations were com- 
rate with past academic achievement. 

Peachers’ ratings of pupils’ aspirational traits were very 
related to the latter’s scholastic standing. Hence, academic 
nee can not be used as an independent criterion of the 
idity of these ratings. 

j In view of the reliably negative correlations obtained be- 
en ‘real life’ measures of level of academic aspiration and 
? ratings of aspirational traits, the validity of the latter 
üre seems highly questionable. 
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chester public schools have recently experimen’ $ 

ways for improving reading ability. The einion. M 
for normal classroom use and were carried on with ninth- 
Attention was centered particularly on methods which 
ective with poorer readers. This effort began in 1951- 
th xperimental use of the fourteen State University of 
ing Films, and was extended in 1952-1953 to include a 
er of other group methods. 


aM 1951-1952 EXPERIMENT 
pose.—During the school year 1951-1952 the State Univer- 
‘Towa Reading Films were used experimentally in all ninth- 
of two high schools. The purpose of this experiment 
investigate the merit of the films in the improvement of 
ability. 
ting up the experiment, certain conditions had to be met: 
actual showing of the films had to be fitted into the regular 
lle of classes; (2) the projection rooms could be used only in 
\ way as not to disrupt their use by other classes in the school; 
ice reading material had to be readily available; (4) the 
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they could be carried on by classroom English teachers who 
not specially trained in the teaching of reading. 


t i Pre-test 
‘orm A of the Nelson Silent Reading Test! was administered 


n H Il results were based on total scores for the Nelson Silent Reading 6 
were composed of scores on the following separately, d sub- 


D 
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to all ninth-grade pupils (four hundred and seventy-three) in two 
high schools in October 1951. The two schools selected were of 
average size, had heterogeneous populations, and were not atypical 
of other high schools in the city. This and all subsequent tests 
were centrally scored by experienced test clerks. 
2) Th@fourteen State University of Iowa Reading Films were 
Bed with one half of the groups at the rate of one film a week, and 
with the other half at the rate of two films a week. Each film show- 
ing occupied three to five minutes at the beginning of the class 


3} After each film a ten-question objective-type test (supplied 
by the film producer) was administered in order to sharpen the 
attention of the pupils to the content of the film. Each pupil kept 
a record of his scores on these ‘film tests’. 

4) During the same class period the pupils read a selected 
Reader’s Digest article and noted the length of time required for 
reading. 

5) Each pupil converted his reading time into a reading rate for 
each article and maintained a continuing graph of his reading rates 
for the fourteen articles. 


Mid-test 

6) A few days after using the last film in the series, Form B of 
the Nelson Silent Reading test was administered. This was M- 
tended to measure the reading ability at the end of the film 
showings. 

7) Then followed a ‘retention’ period of three and one-half 
months during which there was no particular classroom emphasis 
on reading improvement. 


Post-test 


8) After the expiration of this interval, Form C of the Nelson 
Silent Reading test was administered. This was intended to meas- 
ure the reading ability three and one-half months after the showing 
of the films had been completed. din 

9) No control groups were established. Gains were measure! 


zE ail 
tests: vocabulary, reading for general significance, ability to note ae 
and ability to predict probable outcomes. 


$) ad 
ra 


An Experiment in Reading Improvement 409 


Median Gains 
oD iment} At Nee S Total Beyond expectation 
e 
10.39 15 months 8.5 months g” 


classroom situations, and were based upon the grade equiv- 


s established by the test publisher. 
indings.—The findings for the ninth-graders as a gro nTa, 
wn in Table I. 
‘There was no significant difference between the groups that 
one film a week and the groups that saw two films a week. 
During the course of the experiment (six and one-half months) 
gain for the four hundred and seventy-three pupils was 
n months, or a net gain of eight and one-half months beyond 
ation (expectation meaning the number of months in the 
al of the experiment). ; 
3) After separating the pupils on a basis of their scores at the 
ning of the experiment, (a) the median gain of the top quarter 
five months beyond expectation, (b) the median gain of the 
half was eight and one-half months beyond expectation, 
e median gain of the lowest quarter was fifteen months 


4) The distribution of the gains of the four hundred and seventy- 

‘pupils showed that four hundred and forty-five pupils, or 

a per cent, gained more than expectation during the 
the experiment. 

Questions Raised.—Consideration of the findings of the experi- 

; rought up two significant questions: 

1) Would the rather startling gains made during 1951-1952 be 
tained, or increase still more, or decrease, during the subse- 
Quent year, if the pupils had no further experimentation with read- 
‘ng? This question might be answered by follow-up testing during 

llowing school year. 

Would it be possible to attain similar gains by using a simpler 
od made up of parts of the original method? This seemed 
gu arly important because the original method involved cer- € 
n difficulties, such as, (a) the problem of scheduling Qe 
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TABLE I] —Compartson OF GAINS ror NINTH AND TENTH GRADE Tusting 
OF IDENTICAL PUPILS 


Grade Changes in Median Grade Equivalents Interval Comparative Gains 


9th 14.5 months gain during 6.5 | Oct. ’51- | 8 months more 


months (gr. level 8.95 to gr. level] May ’52 than expecta- 

10.4) tion 
10th | 4 months gain during 9 months | May ’52- | 5 months less 
(gr. level 10.4 to gr. level 10.8) Apr. 753 than expecta- 

tion 


a i mi . 
A classes with limited numbers of films and projection rooms, 


P 


(b) eyestrain reported by some pupils, (c) frustration and confusion 
of some pupils, resulting in decreased motivation, (d) the arti- 
ficiality of reading from a motion picture screen. 

Follow-wp.—This was an effort to determine the extent to which 
the gains made during 1951-1952 were carried on into the tenth 
grade when there was no particular emphasis on reading (Question 
1 above). 

The Nelson Silent Reading Test, Form C, was given in April to 
the tenth grade in the same two high schools. Table II includes 
three hundred and fifty-six identical pupils. The first line shows 
gains made during the ninth-grade experiment, the second line 
shows gains made during the tenth-grade when no experiment was 
being conducted. 


1952-1953 EXPERIMENT 


Purpose-—An attempt was made to answer Question 2; namely, 
would it be possible to attain similar gains in reading ability bY 
using a simplified method? ; 

Procedures.—1) All ninth-grade pupils in the Rochester public 
schools (2166) were pre-tested with the Nelson Silent Real 
Test, Form A, in October, 1952. The tests were scored centrally by 
experienced test clerks. sre 

2) The 2166 pupils were divided into the eight groups listed 
low, with a different method being used in each group. Each — 
contained some above-average, some average, and some vee 
average English classes. These eight methods for improving 


pils and Schools 


3 upils in four schools 


F 229 pupils in two schools 


ir 


a 


orm C, in March, 1953. 


Table III. 


periment in Reading Improvement 


2166 pupils in nine schools 
was followed by a ‘retention’ period of three and one-half 


41 


then used for the next seven weeks: 


Description of the Method 


14 timed articles from the Reader’s 
Digest, graph of reading rate 

14 reading films and film tests 

7 reading films and film tests, 14 
timed articles from tffe Reader gg 
Digest, graph of reading rate ` 

3 reading films and film tests, 7 
timed articles from the Reader's 


Digest 

14 reading films and film tests,’ 
timed articles from the Rellder’s 
Digest, graph of reading rate 
(same method as in first year) 

14 timed articles from SRA Better r 
Reading Book II, graph of read- 
ing rate 

14 timed articles from Let’s Read 
(Silver book), graph of reading 
rate 


729 pupils in seven schools No special treatment, pre- and 


post-testing only 


© control groups were established. Gains were measured in 
classroom situations, and performances were expressed in 
he grade equivalents established by the test publisher. 

imgs—1) Median gains for each of the eight methods are 


ering the lowest quarter, the middle half, and the top 

the pupils, separated on the basis of their scores on the 

t at the beginning of the experiment, the median net gains 

tach method (gains beyond expectation) are shown in Table IV. 

bis worth noting in Table IV that these methods produced 

net gains with all groups of pupils and the largest net 

atl the lowest quarter of pupils. 

An examination of the data showed that there was apparently 6 


ee 
C «, 
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Taste III.—Resuurs or PRE- AND Post-Tustine Ninth GRADES, 


1952-1953 
Median Grade Equivalents Median Gains 
Meo At start of At end of 
e prn t e: HEU Total Beyond expectation 
MESMER | 
A 9.2 + 10.3 11 months 6 months 
B 8.9 10.0 11 months 6 months 
Cc 9.0 10.2 12 months 7 months 
9.0 10.2 12 months 7 months 
8.4 9.7 13 months 8 months 
F 9.3 10.5 12 months 7 months 
G 8.6 10.0 14 months 9 months 
H 8.7 9.9 12 months 7 months 


no relation between the size of the net gains of the lowest quarter 
(Table IV) and the median of the lowest quarter at the start of the 
experiment in October, 1952. For example, the two groups with 
the lowest starting grade equivalents were E and H. These made 
neither the greatest nor the least net gains. Similarly, the two 
groups with the highest starting grade equivalents were D and F. 
They made neither the greatest nor the least net gains. The median 
starting grade equivalents ranged from 6.5 to 7.1. 

5) With respect to the lowest quarter of the pupils, the difference 
from one method to the next is not large. It is worth noting, how- 
ever, that the two groups that used the entire series of fourteen 
films (B and E, Table IV) made the smallest gains. The two groups 


Tase IV.—Mepian Ner Gains sy Each METHOD FOR VARIOUS Levels 


or Reaping Apruity Ninta Grapes 1952-1953 


Lowest quarter Middle half | Top querer in 
Method pele i A eq. in 18 tod ea in 9.8 or hgn re 
A 14 months 6 months 2 months 
B 7 months 7 months 3 months 
c 12 months 7 months 4 months 
D 13 months 7 months 4 months 
E 9 months 8 months 7 months 
F 11 months 8 months 6 months 
G 18 months 7 months 7 months 
H 11 months 7 months 4 months 
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e the largest gains were the ones that only read some type 
articles (G and A, Table IV). 

e distribution of the gains of the 2166 pupils shows that 
eighty-five per cent, gained at least as much as expectation 


the course of the experiment. 
a 


methods used, both in 1951-1952 and 1952-1953, were 
in raising reading levels of ninth-graders as a group beyond 
the eight methods used in 1952-1953, there was no appre- 
difference among the various methods in median net gains 
total groups. 


quarter of the pupils. 

the 1951-1952 experiment startling gains in reading ability 

ociated with the State University of Iowa Reading films; 
in 1952-1953 similar gains were obtained without the use 

ing films. 

're-testing and post-testing alone appear to be a valuable 

of motivation for reading improvement. 

lains more than expectation which occurred with experimen- 

changed to gains less than expectation when experimental 


IMPLICATIONS FOR CLASSROOM PROCEDURES 


experiments seem to imply that the critical factors in in- 
reading ability of pupils are (1) the focusing of attention 
on reading improvement of the pupils, and (2) the 
g of attention of each pupil on the possibility of his own 
ovement in reading. This may be accomplished through a 
tic, planned procedure. It would appear that systematic 
y in the use of films and/or practice reading material is a 
ig motivating device. It is also apparent that pre-testing at the 
g of the school year, followed by post-testing several 
s later is a strong motivating force, providing that the test- 
lf is announced and explained in advance. ; 

he once-emphasized attention on reading improvement is re- 
pupils generally will then fail to continue to improve at the 
d rate. Review of the 1951-1952 and 1952-195%qyeng- 


i 


CONCLUSIONS P 


in methods were appreciably more effective with the ` 
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ments indicates that these methods of improving reading result in 
a temporary acceleration of gains. When the emphasis is removed 
the gains tend to decelerate to normal or less, although the actual 
reading level (grade equivalent) may still be above the chronologi- 

cal grade level. 
All the®eight methods described in a preceding section were 
“Bective in improving reading ability of pupils, and particularly so 
for the poorer readers. Each method is relatively easy to use, and 
may be incorporated into most high-school classes with a minimum 
erence with other classroom work. It would appear that 
oom teachers might well consider any one of these methods 

: as a means for reading improvement. 


A 


CHOLOGY PREREQUISITE TO 
YUCATIONAL PSYCHOLOGY 


ROY GLADSTONE 


Oklahoma Agricultural and Mechanical College ® 


at which the investigations reported below were 
whether or not the competence of students in educa- 


) slightly different designs. 
DESIGN OF THE FIRST EXPERIMENT 


The hypothesis tested was that students who had 
n some work in psychology would do no better in 
sychology than those who had taken none. 

The criterion of success used was ability exhibited on 
essay and objective tests submitted by the student. 
‘designed to test the ability of the student to apply 
insights rather than to recognize or reproduce memo- 
A numerical error score was assigned to each piece 
in by the student. The error score on the objective 
ie number wrong. The error score on the essay materials 
by estimating how far short the student fell of the 
Stated in a model answer. The objective test errors con- 
out 1.5 times as much to the total score as did the essay 


e subjects used were drawn from the Fall, 1950, edu- 
yehology classes of the writer. Two groups of subjects 
d. One of the groups was composed of subjects who had 
d were not concurrently taking any courses that were 
chological in nature. The other group was composed 
who had completed one or more courses which were 
chological in nature (whether taken in the depart- 
ychology or not) before entering the course in educa- 
hology. 

The next step was to draw matched subjects from the 
Matching took place in ignorance of the ie rion 


415 @ 


ogy at the Oklahoma Agricultural and Mecheai 
affected by the fact that they had previously taken, 0 
ntly taking a course in psychology. This purpose was 
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scores. The subjects were matched on three variables—ACE per- 
centile, hours of ‘outside’ work, and sex. Twenty-eight pairs were 
matched. The mean difference between the two groups was 0.5 
ACE percentile units. The mean absolute difference was 1.5 witha 
range of 0-4. The differences favored the group having had no 
psychology—a conservative error. 
Twenty of the twenty-eight pairs were matched by sex. l 
In order to match the students on hours of work aside from 
school work, data were gathered by means of a check list consisting 
categories: 0-5 hours of work, 5-10, 10-15, 15-20, and more 
A En? 20. The students were instructed to consider as work anything _ 
which occupied their time which fell outside the “normal student — 
living processes.” They were instructed to consider housekeeping 
] and varsity practice, but not studying, as work for this purpose. 
The matching on this variable was not as good as for the ACE 
(ACE scores were considered more important and given priority in 
the matching process), exhibiting a mean difference of 0.5 scale 
units in favor of those who had taken psychology, i.e., they worked 
about two and one-half hours less per week than those who had 
taken no psychology. This was a non-conservative error. 
Tn all, it is considered that the matching was reasonably good. 
Results: Of the 28 who had taken psychology, 20 did better than 
the subjects with whom they had been paired, 2 did just as well, 
and 6 did more poorly. Using a split of 20 to 8, this yields a chi- 
square of 5.14. With one degree of freedom this corresponds to & 
p-level of less than .05. 
The mean difference between the two groups was 15.5 error w 
in favor of the group which had taken psychology. This correspon 
to about one median deviation of the array of error scores. 


nits 


EXPERIMENTAL DESIGN OF THE SECOND EXPERIMENT 


Hypotheses: Three related hypotheses were tested. The first w 
that those students who had taken the first course only in a dene 
ment of psychology would do no better in a course in education’ 
psychology than those who had not taken psychology. The sec” 
hypothesis was that those who had taken the first course only ae 
department of psychology or one course only which was aes 
psychological in nature, but given by a department other t n 
psychology, would do no better than those who had no psy i 4 
apap eround. The third was that those who had some psycho 08 
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ical background, however extensive, would do no better than those 
who had no psychology. These three phases will be referred to 
| as units 1, 2, and 3. The three units are cumulative—that is, 
2 includes unit 1, and unit 3 includes both 1 and 2. 
_ Criterion: The criterion of success was score on the standardized 
"test, “A Study of Barry Black.” This test is designed to test ability 
to apply psychological knowledge rather than to test memory. [tgo” 
reliability is .79 (1). 
_ Subjects: The subjects were drawn from the Fall, 1951, and 
Fall, 1952, educational psychology classes of the writer e 
1952, educational psychology class of another me A 
Again two groups of subjects were defined. One was composed of 
_ subjects with no psychology course background prior to the course 
‘in education psychology. The other was composed of subjects who 
had taken some psychology course either previously to or con- 
curently with the educational psychology course. 
Matehing: This time the subjects were matched on ACE score, 
hours of work, sex, college class, instructor, and year the course was 
taken, The matchings took place in ignorance of the scores made 
on the test. In no case was the student of one instructor matched 
with a student of the other. Neither was a student taking a course 
in one year matched with one taking a course in another year. The 
‘Text most important variable was considered to be the ACE score 
and some accuracy in matching on the remaining variables was 
' Sactificed in order to get a good match on the ACE score. 

Table I indicates the effectiveness of the matching procedure. It 
Will be noted that the average differences in ACE scores are all in 
| favor of the group which had taken psychology, but the differences 

are $0 small that they were considered negligible. The differences in 
tours of work in favor of the ‘with psychology’ group are propor- 
tionately larger but still small. The differences in class standing 
Were in favor of the ‘no psychology’ group. Matching on the vari- 
“Able of sex was not good but the significance of this is obscure. The 
Mean absolute difference in ACE score was about half a standard 
“tor and in no case was a match made where the difference was 
Seater than 7 or approximately one standard error. In all it is 
Considered that the matchings were adequate. Matchings for unit 
Were best. 


‘Tam indebted to Dr. Roy Sommerfeld of Oklahoma A. and M. College ® 


, for his coöperation, S 
RA a 
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TABLE I.—MATCHING FOR THE THREE UNITS 


Unit 1 Unit 2 Units 
Number of pairs 28 36 39 
ACE mean algebraic difference*:>* —0.32 —0.08 -0.13 
ACE megp absolute differences? 2.9 2.9 2.9 | 
Ninn. a of ACE differences 7 7 7 
ork mean differences** —0.25 —0.06 —0.15 
Class mean difference*:* 0.25 0.25 0.21 
Number of pairs matched on sex 20 21 23 


“The standard deviation of the ACE combined raw score is about 25 (2). 
b The standard error of the ACE total raw score is of the order of 7 (2). 
o A minus indicates a difference which is presumed to be in favor of the 
group which had taken psychology—a non-conservative error. 
a 4 Five-hour classes make up this scale. Thus a difference of 0.25 indicates 
about one and one-quarter hours per week difference in time spent working. 
+The freshman class was assigned a value of one; the sophomore, two 


etc. 
Taste Il.—Finpines or THE SECOND EXPERIMENT? 
Unit 1 Unit 2 Unit 3 
OURO Cte TOE TITSE, PE -a 
Average difference between means*-> 10.93 | 22.03 | 18.86 
t-scores of difference 1.07 2.29 1.95 


^ The difference in raw score between the 40th and 60th percentiles is 
20; that between the 80th and 90th is 20 (1). 
b All three differences are in favor of the ‘with psychology’ groups 


Findings: The results of the second experiment are summarized 
in Table II. 

If the one-tail test is used, all three t-scores correspond to the 
five per cent level or less. If the two-tail test is used only the secoù 
unit reaches the five per cent level of significance, that for ¥ 
third unit falling slightly short of that level (t = 2.025 for 
degrees of freedom at the five per cent level). 


12I am indebted to the Statistical Laboratory of Oklahom' 
College for the calculation of the t-scores. iment was 
è The use of the one-tail test seems justified since the experimen bel 
designed to test only whether having had psychology makes & rye 
D ter able to cope with educational psychology. There is no reason to 
that a complete reversal of this proposition would occur. 
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SUMMARY 


experiments were designed to test the efficacy of a psycho- 
background as an aid to the student in educational psy- 
. Both experiments were paired experiments and differed 
only in that the criterion of success in the first was a 
on all the teacher-made assignments of the course i 
ional psychology while the criterion of success in the pit 
e score on a standardized test. 
e first experiment the difference in favor of the ‘with psy- 
group was fairly large and the five per cent le 
through the use of the fairly crude chi-square. The sévon 
nt was analyzed in three stages. All three showed a differ- 
fayor of the ‘with psychology’ group. Using a paired analy- 
the one-tail test all three stages were significant at the five 
cent level. Using the two-tail test only the second stage was 
ificant at the five per cent level. 


DISCUSSION 


interesting to speculate concerning the reasons why the 
in the second experiment was smaller than that in the 
One possible reason is the fact that, having analyzed the re- 
is of the 1950 experiment, the instructor told the two successive 
that those without the prerequisite psychology were at a 
vantage and gave the class some material designed to help the 
t bridge the gap. Another possible reason is that the criterion 
experiment was both longer and broader—there were 
Ibjective questions and some essay material was included. 


CONCLUSIONS 


‘concluded that a background of psychology will help the 

smake a higher grade in the course in educational psychol- 
y at Oklahoma A. and M. College. Whether or not this would be 
ue ‘other colleges could easily be a function of the nature of 
es which are given. The differences actually obtained were of 
t magnitude to affect the grade by one grade point, i.e., a 
without a background of psychology who received a C 

ly have received a B had he had a course in psychology. 
this is hardly world-shattering, it is certainly of sufficient 


ce to be taken into account. " 
4 
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Lastly, it should be explicitly stated that there is no implication 
in this material that a prerequisite course in psychology affects 
only the grade in educational psychology. It is reasonable to as- 
sume, for instance, that class discussions can go faster and further 
when the students have some psychological background. Investi- 
gations of such factors would be very desirable. 

= 
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AN EXPERIMENTAL STUDY OF THE 
READING-RATE CONTROLLER 


GEORGE F. WOOSTER! 


Y Ohio State University ¢ Ss 
For a number of years the use of various mechanical devices, 
‘such as the reading-rate controller, the metronoscope, the Harvard 
Reading Films, and the tachistoscope, for improving i 
been one of the most controversial subjects in the field of readin, 

Opinions on the topic range from completely favorable to wholly 
unfavorable. A large percentage of the studies published with 

_ tegard to this subject deal with the use of the Harvard Reading 
Films and the metronoscope. While commercial representations 
concerning reading-rate controllers have increased, there has been 
practically nothing in the professional literature about the effec- 
tiveness of their use. 

Most of the reports on the use of these devices (other than the 
teading-rate controller) have been quite favorable. They tend to 
indicate that fine results may be expected. Many of these studies 
have been well done; there is one aspect, however, which should 
be noted—in each case, experimental groups were compared only 
With control groups which had been given no training at all. 

A report from Lauer, Henry, and Fritz (5), and one from Henry 
tnd Lauer (4), deviate from this pattern.*These people, working 
with college students, compared four methods of improving read- 
ing, as well as comparing the experimental groups with a control 
group having no practice. (A mechanical device, the metronoscope, 
Was used with only one of their experimental groups.) All their 
&xperimental groups showed definite improvement, with no one 
group significantly better than the others. These investigators 
Concluded that any reasonable technique involving the reading 

ction would result in improvement, and artificial devices may 

More spectacular than useful. 

Tn View of the claims made concerning the advantages of the 
Teading-rate controller as a means for improving reading, more 


1 This stud: i i in the Department of 
ly was the basis for a M.A. thesis done in the Dep’ 
a logy at Ohio State University under the direction of Dr. John R. € 
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data are needed. The study here considered, although carried out 

on a small scale, was set up in an attempt to gain some data of 

the type needed with regard to these more recent mechanical 
devices. We were directly concerned with the possible value of 

supplementary training on the reading-rate controller in conjunc- 
ion with". course in effective study. 

‘Reading is only one of several topics considered in this effective 
study course. (See Francis P. Robinson, Effective Study; this book 
is used as a basic workbook for the course.) The reading process ` 

ys of improving reading are discussed in class after the 
judénts have read the project in Effective Study. Generally, three 
or four class periods (or portions of several periods) are set aside 
during the quarter for practicing speeded reading. Further con- 
sideration is then given to reading as it may be a part of other 
topics covered, such as the SQ3R (Survey, Question, Read, Recite, 
Review) method of study. 

If students wish to work on improving their reading skills in 
addition to what is done in class, individual help may be obtained 
from the instructor. Some students may also obtain individual 
help from a graduate counselor. (Practicum training of counselors 
includes a period of counseling with students who are taking the 
Psychology 411 course.) In such cases, there is a reading-rate 
controller available for individual use if desired, as well as different 
types of practice reading material. 

While improvement of reading is not, in itself, the major purpose 
of this effective study Course, experience has shown that during & 
quarter (approximately ten weeks) a large percentage of the 
students can be expected to show some improvement. Changes 1” 
reading rate and comprehension are indicated by a comparison a 
test scores, particularly those obtained on two parallel Robin 
Hall Reading Tests which are given at the beginning and the en 
of the quarter. Knowing that some improvement could be e%- 
pected, the question to be considered was whether or not greater 
improvement would be achieved if students were given training 
on the reading-rate controller in addition to the training Tece 
in the course. 


} 


PROCEDURE 


D> All students taking Psychology 411 (Psychology of Eada’) 
Stugy gpd Individual Adjustment) during the spring quartet 


p ð 


six students each were selected, on recommendation by 
ctors of the several classes, from the large number who 


for the project. Lack of available facilities limited the 
e 


luling training periods for each of the students, however, 
difficulty encountered in trying to match the groups on 
the reading test score, made it evident that any 

only be approximate. Consequently, for statistical 
the groups were treated as unmatched groups. 

training sessions were scheduled during a four-week period. 
tal Group A received three hundred forty-five minutes 
ing on the reading-rate controller in eleven half-hour 
and one fifteen-minute session. Experimental Group B 
three hundred fifteen minutes of training on the controller 

-hour sessions and one fifteen-minute session. f 

levice used was the reading-rate controller manufactured 
: Three Dimension Company of Chicago, Illinois. (A desorip- 
the apparatus, the mechanics of operation, and the manner 
ng material can be found in the manual, Reading-Rate 
by Helen M. Robinson, Director of the University of 
Reading Clinic.) The reading material used consisted of 
sections taken from various issues of the Reader’s Digest and 


S of reading. This allowed for a change of the rate setting 
controller within a session as well as from session to session. 
for increasing the rate setting were determined on the basis 
subject’s report that he could keep up with the machine, or 
of it, without difficulty, and an oral comprehension check. 

was increased only if the comprehension level as checked 
d at seventy per cent or better.) Increments of increase in 
‘setting were kept small—ten to fifteen words per minute. 
ed for more frequent increases, which it was believed 
e as a further motivating factor, and aided in keeping a 
eck on comprehension. Subjects were not told at what q 
y were reading unless they inquired about it. Y= 


\ a, 


i 


2 


2 


2) 


Pa 
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At the end of the training period, the twelve subjects, as well 
as all other students taking the course, were tested on the “Read- 
ing Test in History: Russia,” a test form comparable to the pre- 
training test.2 The scores from this test could then be compared 
with the scores obtained on the “Reading Test in History: Can- 
ada,” the pre-training test, which provided a basis for comparisons 


Doth within the groups and between the groups. Scores on both 


these tests were available for a group of forty-eight students in 
these classes in addition to the twelve subjects in the two experi- 

groups. The former group was then considered as a control 
group to determine if those who had received training on the rate 
controller in addition to taking the effective-study course showed 
significantly greater changes in reading ability than those who 
took the course but had no supplementary training on the reading- 
rate controller. 


RESULTS 


Comparable pre-training and post-training scores on rate and 
comprehension were obtained by using two parallel reading tests 
from the Robinson-Hall series. With these scores, a comparison 
was made for the subjects within each group, and the means of 
the three groups were compared with each other. All differences 
were tested by the use of the t-test. 7 

From an examination of the data thus obtained, the following 
conclusions may be drawn: 

A) With regard to rate: 

1) The gain in rate was significant at the two per cent level for 
the two experimental groups, and at the one per cent level for the 
control group. 4 

2) Variability was greater on the posttest; the standard devis- 
tion was greater for the scores on the posttest in each group: á 

3) Decreases in rate were evident for only eight students, all 
whom were included in the control group. 


4) Comparing the mean gains in rate among the three groups, 


small differences were noted, None of these differences, however, 
was statistically significant. 
o Manual 


2 For information on the comparability of these two tests, see thi 
for the Robinson-Hall Reading Tests, Ohio State University. 
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a 
Le 1.—Tmts TABLE SHOWS THE GROUP MEANS ON THE PRETEST AND 
Posrrest FOR RATE AND COMPREHENSION 


Mean Test Scores 


Rate (words per minute) Comprehension (per cent right) 


Pretest Posttest Pretest i Posttest "i 


189.83 | 237.83 69.33 63.50 
184.83 | 220.33 81.33 73.67 
181.80 | 226.40 64.19 —— 


h. © 

With regard to comprehension: 

each experimental group, one-half or more showed some 

so in score; nearly one-half of the control group also showed 
fariability within the group was less for the posttest than 

he pretest; this was true in all three groups. 

io clear pattern of relationship between amount of gain or 

te and amount of gain or loss in comprehension was 


SUMMARY 


brief study has been concerned with the possible value of 
mentary training on the reading-rate controller in conjunc- 
M with a course in effective study. As measured by two parallel 
of the Robinson-Hall Reading Tests in History, most of the 
ents taking the course showed some improvement in rate of 
As a group, the mean difference in rate between the 
t and posttest was statistically significant (P = .001) for 
who did not have additional training. There was no statis- 
Significant difference in comprehension scores. 
ts receiving additional training on the reading-rate 
er also showed significant improvement (P = .02) in rate 
g. Again, there was no statistically significant difference 
ehension scores. Gains made by these experimental 
ps, however, were not significantly different from the gains of 
trol group. { 
case, no extra benefits seemed to accrue from additional 
on the reading-rate controller, as used. The study sheds 


t= 
is. 
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some light on the possible value of using such a device in me 
tion with a course where reading is treated only as one of several 
topics, Further investigation is needed, however, concerning the 
practical usefulness of these machines and the comparative effec- 
tiveness of different approaches to the problem of improving 
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TIONSHIPS BETWEEN EIGHT 
RIABLES AND F-TEST SCORES 
OF TEACHERS 
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H. H. REMMERS AND MARTIN STEINBERG ¢ 


Purdue University 


igators have concerned themselves with the pı 
ization of attitudes, Hartshorne and May (1), for 
concluded that it is unjustifiable to consider hon- 


e, and self-control as general characteristics of children, 
that these traits are a functionvof the situation. On the 
, Herrick (2) has drawn conclusions in favor of general 
college students concerning social issues and matters 
ct. Gough (3) has found that the more antisemitic sub- 
is samples are also characterized by lower intelligence, 
d economic background, conservatism, and less 
to name but a few of the relationships discovered. 

d others (5) came to similar conclusions in their exten- 


sent investigation is tangential to those mentioned 
major interest here is in the factors related to certain 
. Struening (4), with university faculty members as 
in a study similar to the present one, found a correlation 

een attitude toward the church and F-test scores. 
scores on the measure of attitude toward the church 
vorable attitudes, the correlation indicates that the 
ble individuals on this continuum tend to be more 
ian. In addition, Struening found that a favorable 
toward the church was positively related to church 
rship. Another interesting finding was that the most favora- 
p was Catholics, then Protestants, and at the least favora- 
of the continuum, Jews. This same order prevailed in 
es, with Catholics at the more authoritarian end of the 
An attempt is made in this study to determine the 
ips, if any, between scores of elementary- and secondary- 
chers on the Social Attitude Scale of Adorno et al (5) 


gi t> 
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and nine other variables. The ten variables are as follows: 

1) Scores on the Social Attitude Scale or F-test or SAS (6) 

2) Scores on the Sim’s Social Class Identification Scale, or SCI 
(6) 

3) Attitude toward the church, as measured by the Kelley- 
Remmers Scale of Attitude toward Any Social Institution, or SAT 


=) 


4) Sex (weighted 0 for male, 1 for female) 
5) Age 


W hurch attendance (weighted 0, 1, 2, and 3 for increasing 
frequency of attendance) 


2) 


7) Home discipline (weighted 0 and 1 for lenient and strict, 
respectively) 

8) Political party préference, or Pol. (weighted 0, 1, and 2, for 
Republican, Democrat, and Independent, respectively. An as- 
sumption is made here for correlational purposes that a continuum 
is represented by the order of code numbers.) 

9) Race (weighted 0 for white, 1 for Negro) 

10) Religion (Protestant, Catholic, or Jewish) 


COLLECTION AND ANALYSIS OF DATA 


The data were collected from one hundred eighty graduate 
students in education at the University of Southern California 1m 
July 1952. Practically all of the members of the group were teach- 
ers or school administrators. All intercorrelations among the 
variables were computed, and three multiple correlations with 
F-test score as the dependent variable were calculated. The three 
multiple correlations were for Protestants, Catholics, and for the 
total sample, including Protestants, Catholics, and Jews. A sepa- 
rate multiple correlation for the Jewish sample was ruled out due 
to the small sample size (N = 8). 


INTERPRETATIONS AND CONCLUSIONS 


1) It must be remembered that the correlations presented e 
probably underestimates of the population parameters, since a 
group was extremely homogeneous. As concerns the curvilinearity 
of the church attendance variable, Table 4 indicates that noni 
the etas were significantly greater than the original correlatio 
between the same two variables. 


A 4 
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2) Although the mean scores of Catholics do not differ signifi- 
cantly from those of Protestants on any variable, and despite the 
‘fact that the correlation matrices for these two groups are very 
ar, still the multiple correlation for the Catholic sample is 
‘Significantly greater than that for the Protestant group at far 
beyond the one per cent level. 

8) An eta of .248 between frequency of church SUAS ange 
SAS scores for the total sample is found in Table 1. In addition, 
‘the eta representing the relationship between frequency of church 
attendance and attitude toward the church is found to 
These etas are both statistically significant, at better aan 
one per cent confidence level. This investigation, then, has sub- 
'stantiated Struening’s findings, mentioned above, with regard to 
the correlation of church attendance with both attitude toward 
the church and authoritarianism scores. In Table 1 the correlation 
between F-test scores and attitude toward the church is also 
found to be significantly large. 


TABLE 1.—CORRELATION MATRIX ror ToraL SAMPLE 


SCI | SAI | Sex Age i Home| Pol. Race 

Soc, Att, Scale .024|.1614/.044 | .003 |.193*|.122 |—.174t] .168f 
~ 80c, Class Inv. .226*|.263*|— .220*|.004 |.051 |—.170|—.119 
- Att, to Church “142 |—.038 |.281*).011 |—.033 | .031 
Bex .288*|.172t|.199*| .001 | .080 

Ago ‘006 |.132 | .053 | .060 

Church Attend. 120 |—.173ł| 180} 

Home Discip. —.055 | .052 

Polit. Pref, .122 


* Significant at the 1 per cent level. 
t Significant at the five per cent level. 


Tastin 2.— MULTIPLE Corrstations or ALL Varrapus WITH SAS 


i R 1.23456789 3 
Protestants (N = 145) 323* 075 
Catholics (N = 27) 694* 102 
Total (N = 180) 330* 067 
* Significant at the one per cent level. X € 
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Taste 3.—Means or Att VARIABLES AND STANDARD DEVIATIONS oF 
CONTINUOUS VARIABLES 


| sas | scx sat | Ase | Sex [church | tome | pol | ma 


Protestant (N = 145) 


9.33 -26 | 1.26 | .61 | .70 | .04 


1.77 


34.06 
7.14 


“y Mean ”|—16.85 | 19.54 
sD 21,54 | 4.83 


Catholic (N = 27) 


9.85 | 9.44 | 33.63 | .37 
21.65 | 5.42 


1,86 | 6.81 
_ Total (N = 180) 
Mean |—17.09*) 19.56 | 9.26 | 83.86 | .26 | 1.34 | .59 | .72 | .06 
SD 22.08 | 4.91 | 1.85 | 7.04 


* Jewish group pulled mean for total sample in a negative direction. 


TABLE 4.— SIGNIFICANCE OF DIFFERENCES BETWEEN LINEAR CORRELATIONS 
AND Pras FOR CHURCH Arrenpance vs. Four 
Non-copep VARIABLES 


Linear r Eta 


eee 
Protestants (N = 145) 
Church vs. SAS .111 .209 >.05 
Church vs. SAI .287 .289 >.50 
Church vs. SCI .025 .187 >.05 
Church vs. AGE —.023 —.107 >.30 
Catholics (N = 27) 
Church vs. SAS 437 ATT > 30 
Church vs. SAT 099 .215 >.50 
Church vs. SCI —.229 —.251 ZA 
Church vs. AGE .087 .136 >b 
Total (N = 180) a 
Church ys. SAS .193 248 FR 
Church vs. SAI 281 |* 200 iti 
Church vs. SCI -004 -146 ar 
Church vs. AGE -006 .079 Za 
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ce derived from the statistical analysis points in 

ection. Regression equations for Catholics and Prot- 

re computed from the correlation matrices. It was 

the most heavily weighted factor for predicting F-test 

the Catholic sample was frequency of church attendance. 

heavily weighted factor for the Protestant sample was 

toward the church and frequency of church atten 

‘ourth and seventh, respectively, in this regard. This 

substantiation of Struening’s finding that Catholics 

er than Protestants in scores on the F-test. ` 

are three possible explanations for these inde 

‘sons who are authoritarian may find that some of their 
satisfied by affiliation with the church. The second 

might be that the education one receives through the 

the church is authoritarian in character, and that this 

carries over into other areas of life. Third, there may 

dependent third factor which influences both authoritar- 

es and attitudes toward or membership in the church. 
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HOSTILITY: AN EDUCATIONAL PARADOX : 


HARRY A. GRACE 


Michigan State College! 


Investigators of hostility have been primarily concerned with 

ejudiced people, anti-minority feelings, or ethnocentrism. This 
bias of investigation has precluded the possibility of analyzing 
hostility per se, devoid of special targets. We are concerned with 
tern of hostility’, regardless of the particular target. Once 
weeconstruct a systematic theory of hostility clarification of 
target specificity will readily follow. We are not interested in 
which target is most or least liked, but, given the rank order of 
9 any group of targets, how hostility is patterned. 

We postulate that action is dependent upon intellectual and 
emotional forces. Rather than try to test such a broad thesis, we 
concentrate upon the hostile act as dependent upon both informa- 
tion and attitude. Should this be systematically demonstrated, 
social behavior theory may be anchored in a general theory of be 
havior, This paper presents data which cross-validate our hypoth- 
esis. 

Instruments were developed to measure subjects’ verbal reac- 
tions to various catastrophes in terms of the amount of informa- 
tion they felt they had about them, their attitude toward them, 
and the amount they felt threatened by them. Empirical results 
of hostile attitudes toward nations indicated that one has to feel 
one has some knowledge about objects in order to be hostile 
toward them. (1) 


METHOD 


The instruments prepared for this study offer a method of ne 
construction which is easily administered and has high reliability 
Modified paired comparison and Likert scaling techniques F 
combined to yield a flexible instrument which measures A, 
direction and intensity of attitude. The test may be manki 
scored, It may be applied to preference inventories, atiti 
scales, or sociometrics. 


O 1 Milton R. Canfield, Iowa State College, and James F. Guyot, Michiga? 
State ege, coöperated on this study. 


A = 
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Mity apni students at, the University of 


abl familiarity, ete. i were: pile 

abetes, heart attack, leprosy, nervous breakdown, 
poliomyelitis, syphilis, traffic accident, and ae 

r pair in each inventory the subject marked a peo 
Catastrophe (or indifference) to complete < the 

statements: ) 
dislike for . . . . than. 

accurate ihiowledgs and information about . . . . than . 

hreatened by and fearful of .... than... 
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An item might look like this: 
ANEMIA more about equal more CANCER 


The subject marked his choice on an IBM answer sheet. Each — 
catastrophe was scored separately. Two points were given for 


every ‘more’ answer and one point for an ‘about equal’ answer. 


RESULTS 


intensity scores for each stimulus were plotted on the 
to check linearity. The dislike and informa- 
ìon, stimuli were both linear, but the threat was curvilinear. 

Stimuli were ranked in order of dislike for each subject. Rank 
one might include several stimuli, for different subjects selected 
various catastrophes as most disliked. The information scores for 
the attitude stimuli classified on the basis of ranks of dislike were 
determined and plotted. The hostility (threat) scores for these 
stimuli were similarly determined and plotted. The smooth curves 
are shown in Figure 1. 

The data indicate a curvilinear relationship between assumed 

information and dislike. Threat is directly, linearly related to 
dislike. 


DISCUSSION 


The data on threat and dislike differ somewhat from data on 
hostility and liking. Curvilinearity appears in both cases, but the 
direction of the curve is reversed. This difference may be attributa- 
ble to the different items (nations or catastrophes), or to different 
instructions (hostility or threat, like or dislike). We favor the 
latter explanation. It is corroborated by the sociometric analyses 
in which one finds differences in the distributions of positive 4m 
negative choices. Liking may not be the antonym of disliking, n0" 
hostility of threat. 

Another explanation is that these differences represent ual 
patterns of hostility. Let us assume they are. Then, if it 8 ie 
task to educate persons with regard to a set of stimuli, it is entire” 
possible that our program may have serious effects upon hostility- 
The more information one feels he receives about things he likes, 
the less hostile he may become; the more that he thinks he know 
about things he dislikes, the more hostile he may become. ooi 
versely, we feel we have greater information about stimuli whic 


D 
p ® 
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consider moderately threatening to us, those which we feel do 
threaten us and those which threaten us severely we feel less 

] about. In either case the effect of additional information 

| not be linear. 

is may explain some of the boomerang effects in our educa- 

al efforts. Information behaves like so much ammunjtion when 

r dislikes are strong. When we are under threat we may not ge 
perceive additional information. Before we can predict the Out- 
of a program of hostility or threat reduction yw ; first 
tain the pattern upon which our efforts of fabe 

erted. Otherwise we are apt to increase the feelings of hogtility À 
and not affect threat in the least. The paradigm of hostility has ` 
a wide range of basic and applied research possibilities and offers 
ition to a practical paradox in education. Guiding hypoth- 
= eses for future research are: 

1) Hostility (threat) is dependent upon the relationship between 
an individual’s attitude toward a class of stimuli and the amount 
‘of information he feels he has about it. 4 
The pattern of hostility may be changed by altering either 
ude or amount of information, or both of these variables. 

) The ease and significance of changing a pattern of hostility 


depend upon the pattern held and its generality to other stimuli. 


SUMMARY 


_ The genesis of a systematic theory of hostile behavior is pre- 
ented. Hostility is considered as the resultant of attitude and 
"assumed information. A method for the study of hostility patterns 
is described. Results indicate that the relationship between atti- 
tude and assumed information is curvilinear, The educational 
Implications of this curvilinearity are discussed. 
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BOOK REVIEWS 


Jerome M. Sztpman. Editor. The Adolescent: A Book of Readings. 
New York: The Dryden Press, 1953, pp. 798. $4.50. 


Sixty-seyen selections, in six parts and twenty-two ‘chapters’ 
or may not be worth four dollars-and-a-half of an under- 
uates money. Aside from its binding and paper (which in this 

hi verage) the value of a book lies in its content. In 


; Cöntent depends upon the care and wisdom with 
fich the editor has sampled the field, ‘and upon the.degree of 


all “integration he is able to achieve with his comments and organiza- 


tion. Finally, the sample of content depends upon the population 
of content. 

Perhaps it is this last point which is the key to Scidman’s The 
Adolescent: A Book of Readings. Every discipline which deals with 
human beings might touch upon the particular class of them which 
is called ‘adolescents’. In fact, Dr. Seidman has included—as a 
kind of horrible sample—one short piece which is from no schol- 
arly discipline at all, but which belongs, rather, to the profession 
of journalism. It is called “How To Be a Woman,” and “although 
the reader may derive some insight from it, he will, of course, 
recognize that this article is written on a popular rather than a 
scientific level.” To criticize adversely on the basis of these three 
pages would be unfair except that they illustrate what seems to 
this reviewer an excessively catholic taste. Anthropometry, an- 
thropology, education, physiology, psychoanalysis, psychology and 
sociology are all represented. The selections range from the severely 
empirical and methodological paper of Gough and Peterson 
(“The Identification and Measurement of Predispositional Factors 
in Crime and Delinquency”) through a labored excerpt from 
Lewin that strains to describe the problems of this period in 
field-theoretical terms, to the provocative but very interpretive 
observations of Margaret Mead and Caroline Zachry. In short: 
the sample is too broad and too shallow. Too broad, because most 
undergraduates will be unable to put many of the unrelated 
selections into any perspective at all. Too shallow, because 80 
much breadth has been attempted that not enough information 
about any single problem can be included. 


Some i this difficulty results from a population of writings 
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in terms of subjects rather than subject matter. The im- 
that adolescents are a breed apart, which is doubtful. 
her hand their parents may well agree, and in any event 
es of many colleges and universities make this con- 
to student interest. But there is then no reason to include 
which deal with quite different topics and ages. Among.’ 
Oscar Lewis’ “Husbands and Wives in a Mexican Ji- 
Gordon Allport’s “The Resolution of Intergroup Tensions,” 
ay and MacKinnon’s “Assessment of QSS ou 
foregoing general, or principled, object 5 Mec 
which is the occupational hazard of the anthologist® wh¥ 
cular choice and the omission of another? Individual 
es of opinion of this sort combined with the fundamental 
ity in the scope of the collection will cause many an in- 
to think long and hard before asking his students to buy 

ctor would assign all of it, but any instructor will find 
some things he would like all of his students to read. 
many will find enough to justify the addition to their 
s’ financial load is hard to say. 


n 
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Joun P. McKen 
versity of California 


E. Oscoop. Method and Theory in Experimental Psy- 
y. New York: Oxford University Press, 1953, pp. 800. 


lowing the large Handbook of Murchison (1934) and the 
more recent volume edited by Stevens, we have here another 
tal and detailed discussion of experimental psychology. 
or states that his principal purpose is to provide a “text 
luates experimental literature in close relation to critical 
ical issues.” The attempt has been “to provide an inter- 
Tramework that both brings out the significance of existing 
and directs the reader’s thinking toward as yet unsolved 
hat need further research.” It is presented both as a 
as a book of reference. “Although the coverage is exten- 
‘also selective.” To overcome “the potential disadvantage 
le authorship,” the author submitted many chapters “to 
more experimentalists who are experts in the fields con- (ġ 
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There are four parts into which the subject matter has been 
divided. Part 1, Sensory Processes, includes, sensory quality, 
sensory intensity, audition and vision; Part 2, Perceptual Pro- 
cesses, includes, the nature of perceptual organization, projection 
dynamics in perception, and central dynamics in perception; 

art 3, L@arning, deals with fundamental operations in learning, 
Bice of learning, certain controversial issues in learning theory, 

neurophysiology of learning, serial and transfer phenomena, and 

rete symbolic Processes, discusses problem-solving 
ght, g, and language behavior. 
Ac®ording to the preface this work is the result of about ten 
years of effort given primarily to this project. It has the unity 
that the thinking of a single individual can give to it. If one looks 
for the report of a certain experiment it may not be found. But on 
the whole, critical judgment must recognize the magnitude of the 
contribution in which so much of experimental results and theo- 
retical considerations have been collected and interwoven into a 
well-organized presentation. Stress was made upon those studies 
that seemed to be most pertinent to the theoretical issues. In 
general the selection has been excellent and the advanced student 
of psychology will find a thoroughly valuable contribution to the 
understanding of the subject in its present stage of development. 

The critical student will perhaps be helped to see the generally 
unsatisfactory nature of theoretical explanations as they exist 
today, and to realize the need for much more fundamental re- 
search for the best progress of the science. Not only this, but he 
may find definite suggestions for the planning of much needed 
research, 

One of the valuable contributions of the volume lies in the 
inclusion of certain material not before brought together in a 
single book, such as synaesthesia, certain material in connection 
with problem-solving and transfer, and in relation to language 
behavior and communication. £ 

Students of educational psychology should find valuable contri- 
bution to their thinking especially in connection with the discus- 
sions of learning, transfer, problem-solving, thinking and language 
behavior. They might also be helped to recognize the fact that 
theoretical explanation based on insufficient experimental data 
» has been inadequate in psychology and too often carried over 

int tisfactory educational theory. And, perhaps again, they 
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d to see that theory which conflicts with physiological 
sical facts already known, is to be very carefully avoided. 
will be found to be especially valuable for reference 
bibliography and author index contains 1290’citations. 
index is well made and extensive, pages 781-800. A 
mber of figures and graphs add to the interest Gnd clarit 
entation. 
e of the decidedly valuable contributions to 


scholarly, generally clear in stators p 
omplicated and difficult definitions. 


chol- 


within psychology, this contribution may help them 
psychology has been developing out of the ‘psychologies’ 
it, as in other sciences, the good will be sifted out from the 
that a sound psychology will be established and will pre- 
the part-psychologies and the pseudo-psychologies that 
ve been inevitable outbreaks in the development of the 


author and publisher are to be congratulated. 

A. 8. EDWARDS 
he University of Georgia 

ERT LINDNER. Prescription For Rebellion. New York: Rinehart 
and Company, Inc., 1952, pp- 305. $3.50. 


improve mankind is our goal, what the world needs is to 
n the fallacy of adjustment and cultivate instead an appre- 
for the indispensability of its alternative—positive rebel- 
a pattern and way of life. This, in essence, is the theme of 
3s Prescription for Rebellion. Admittedly, the volume 18 


ed to sound an alarm. The conten 


t of the volume he presents 
Diagnosis, is composed of 

i Binds; Shamans 
Rebel’s Fate; 


sion; Adjustment—Causes and Conse C 
nist. Book Two is called Prescription and inclu 
rs, each called The Revolutionist’s RE 
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diagnostic chapters contain more startling and surprising subject 
matter and interpretations. The chapters on prescription are mild 
by comparison. 

The book, advises the author, represents a feeble attempt on 

his part to write a prescription for living for use by all. No one 

reading tae book will consider it a feeble attempt. The book is 

eter in strong language and it is more than outspoken—it is 

often gondemnatory of many of the things considered to be proper 

nov ield, and is written with seeming aim to disturb 

ave Eito action and get them tense enough to express 

terest in doing something about it. A cult of passivity and 

surrender is exactly what he is fighting against and he fights 

openly and strongly. For the first many chapters, the book sounds 

> more like that written by a prophet of doom or the chronicler of 

psychological disaster than one who is making a feeble attempt. 

The very first sentence of the prologue is illustrative. It reads 

“the plight of modern man is pitiful . . . Now, with this knowledge 

in his grasp, he finds it being misunderstood or conscientiously 

misapplied. If ever there was cosmic tragedy, this is it.” This 

error of prescribing for adjustment, in his words, amounts to a 

literal perversion of a most important, if not the most important, 

area of knowledge to an instrument for the continued debasement, 
exploitation and enslavement of man. 

Antagonists with good professional status, he has many. He 
particularly picks on Franz Alexander for what he considers to be 
his emphasis in therapy, namely, domestication; Karen Horney 
for representing only the upholding of feminine psychology; 
Harry Stack Sullivan for seeming to be too satisfied with happiness 
based on conformity and, in that sense, being an advocate of 
reaction; Carl Rogers, whose non-directive psychotherapy he 
refers to as a “potpourri of ventilation, reassurance, insight and 
will therapy.” He is even more vehemently opposed, and spends 
more time considering with hostility, what he refers to as the 
‘shockiatrists’. He is also critical of psychosurgery and its undis- 
criminating use, particularly. But he does have some protagonists. 
He reacts favorably to Abraham Myerson, Bertrand Russell, 
Norbert Weiner, Roland S. McCullough. And, after damning 
many psychoanalysts, he comes out with the conclusion that the 

N most effective of all psychotherapeutic tools for accomplishment 
of ti of treatment is psychoanalysis. But he adds that the 


Tes. 
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assume the responsibility for the philosophy with 

nt leaves his analysis, and that the philosophy 
t of rebellion on which the fundamental premise 
the last three attributes of the rebellious person: 
employment and tension. 
tment is not necessarily a criterion of a goc life, a 
n personality or integrity, is not exactly a new concep 4 
of mental hygiene. William Healy, psychiatrist, Clif ‘ol 


st, and other students of delinga 
bo srehpeasuieh 


ed between integration as agains 


y means conforming to delinquency. It probably is a 
too often in the more academic texts, adjustment has « 
d in a manner which justified criticism, but whether Q 
s this kind of alarming consideration is a question mark. 
is to some extent justified; the means are extremely 
But, regardless, this is an interesting book to read, 
es the attitude that is deserving of some attention, 
t is expressed in over-generalized, exaggerated, and 
ionary terms. As such, it can make for valuable 
f taken as an expression of protest rather than as a state- 


H. MELTZER 


THITAKER AND Tuomas P. Maront. The Roots of Psycho- 
. New York: The Blakiston Company, 1953, pp. 236. 


Roots of Psychotherapy two professors from Emory Uni- 
ment of psychiatry, one a Ph.D. and one an M.D, 
attempt to develop a frame of reference to serve as a basis 
g features of the treatment process which are common 
of 
psychotherapy has become more and 
e dealing with a recognizable process. The content of the 

nted in three large sections: Foundation, Process, and © 
The section on Foundation considers the b plogical 

pret 


on 
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basis of psychotherapy, implicit and explicit psychotherapy, and 
dynamic psychiatry and psychotherapy. The section on Process in- 
cludes chapters on the patient as a person, the process of psycho- 
therapy, anxiety and psychotherapy, the therapist as a person, 
patient-vectors in the therapist, and restatement of the problem of 
psychotherapy. In the section on Techniques are presented some 

ghs techniques found useful by the authors. The book is intended 


aevnors express in the preface is that such rah 
heild Ee little difficulty comprehending the essential meaning 
of the book. The book contains a glossary of terms gs well as a 
bibliography, but does not contain an index. g 

$ Of interest to clinicians will be the chapter on some techniques 
in brief psychotherapy. Here will be found a brief consideration of 
the various stages of therapy—beginning phase, symbolic phase, 
and ending phase. Here too they will find a consideration of tech- 
niques of administration in therapy. Technique is defined by them 
as “an interpersonal operation deliberately used by the therapist, 
the function of which is to transpose social, latent affect in both 
participants into deeper, manifest affect in order to catalyze the 
affective and symbolic process of psychotherapy.” This may be 
a good definition, but not easily understood by a layreader. Pre- 
sented here also are techniques for expanding the fantasy compo- 
nent of therapy, this on the contention that adult oriented fantasy 
frameworks are less significant as methods of communication than 
are the symbolic forms which the culture provides. Also consid- 
ered in brief are pitfalls of psychotherapy. 

The authors show a ready familiarity with not only psychiatric 
and psychological literature but also related philosophical litera- 
ture. Illustrative references include the writings of Rank, Ruesch, 
Schilder, H. S. Sullivan, Kubie; Rogers, Lewin, and Piaget; and 
Einstein, Morris Cohen, Bridgeman, and Whitehead. Sane and pro- 
fessionally sound the book is. An honest attempt to build up and 
develop a frame of reference for consideration of therapy for prat- 
tice and in research purposes it also is. And valuable to professional 
therapists it can be. For popular purposes the writing just isn’t 
clear enough, particularly in the early chapters. 
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oyp. Educating the Sub-normal Child. New York: Phil- 
Library, 1953, pp. 148. $3.75. 
WNSEND. Introduction to Experimental Method. New 
McGraw-Hill Book Company, 1953, pp. 220. $4.00. 


two books interested on very different levelf in or 


known to the human mind; namely; Yea 
sooks have significant agreement in one of the 
f teaching, that of getting the learner to participate 


Foreword maintains is “an ordinary school and not a 
of the type to which authorities like to direct visi- 
v). 
information and discussions include the problem of the cri- 
which E. S. N. (educationally sub-normal) children 
selected for special school treatment; social background 
des of parents; social training in the special school; 
the special E. S. N. school; activities in the special 
school; what success means in this kind of school; and 


ly new reported; but rather in the brief story of what 


fe now in many places considering the organization of such 
or classes, and who desire to know the judgment of one 
had long experience in the work. 
midst of what seems to be useful information and good 
onal judgment, there are amazing statements about 1Q’s. 
‘of one of the cases was reported as 42 at the age of seven, 
at the age of twelve. (p. 81). And a ‘doctor’ was reported 
as suggesting that under proper conditions, the IQ might 
other twenty points! One wonders what psychological ex- 
n was used; what was the competence of the examiner; 
icularly, was the child in proper condition when he was © 


i i i curate megis] age 
so that it was possible to obtain an accurate Lase 
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and intelligence quotient? Especially, we should like to know if the 
author means to say that the methods of the school actually 
raised an imbecile so near to the intelligence of a normal child. 
Tf this can be done, educators will want to know a great deal more 
about the methods used in this school. 
A u@ful index completes the book. 

"1 Quite as important, and for very different reasons, is the volume 

of Lownsend.,.aegetontribution is not to teaching in one or a few 


Sn all fields of science. Its purpose is to demon- 
JOE Sy from ‘cook-book’ methods, away from mêre 
following of instructions, to insight and actualfarticipation in the . 
performing of scientific experiments. i 

“The emphasis in this book is placed on preparing’ >e student 
to think along lines of the development of sound reseat@ai designs 
rather than the successful execution of standardized traditional 
experiments, In this way, it is believed that the beginning student 
may be exposed to the elements of theory and method necessary 
for the understanding of the performance of experiments. Equally 
important, the student will be given opportunity to become sensi- 
tive to the ways of critical thinking early in his contact with sci- 
ence and not be forced to labor in darkness and boredom until he 
becomes a graduate student.” (p. vii). 

The main parts of the book include theory of experimentation; 
design and conduct of experiments; interpretations and conclu- 
sions; and applications of the experimental method. 

In such a book there is the very difficult problem of selection of 
material, especially, since it is a relatively new kind of book. In 
its ensemble it is new. In detail it considers scientific versus non- 
scientific procedure; the locating and simplifying of problems; . 
hypotheses and variables; control of the experiment and use of 
apparatus; the conducting of the experiment; the use of logic and 
of statistics; the making of the report including the making of 
tables, but not enough about the making of tables; pictures and 
brief descriptions of apparatus; and the appendix includes four of | 
the most commonly used statistical tables. 

It is easy to point out details which one would suggest for 
inclusion in the volume. The reviewer is inclined to think that the 
book would be more interesting and useful to the student if there 
9) were moyeroncrete psychological subject matter to balance the 

amoj di aisrtract and methodological material. From one point © 
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more useful to the instructor than to the student 
tructor already has the concrete material on the 
the logic and methodology can be thought out. 
can supply the concrete material and may do so 
plementary text. Except for some mention of ap- 
practically nothing in the book relating toranimal 


by Townsend could vastly improve the 
ce in all departments of colleges and universities; 
simple application could be made to improve 
ig in the high schools. But it will need more concrete 


concludes with a useful index, pp. 215-220. 
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ie 


fer of training could be discovered through a statistically 
d experiment. Transfer for this study was defined as the 
application of skill or knowledge in adjusting to a new 
. The amount of transfer to the new situation was meas- 
the difference between the actual performance and the ex- 
Performance. The actual performance was the attainment 
ill or knowledge to be transferred had been learned, The 
‘performance was what the attainment would have been 
skill or knowledge to be transferred not been learned. 
the purpose of delimiting the scope of the investigation, 
of training was restricted to the successful application of 
ations only. Sex differences in the amount of transfer were 
£ by comparing scores obtained in the new situation with 
predicted on the basis of knowledge acquired previous to 


technique used in this investigation differed from the cur- 
popular techniques for measuring sex differences in two re- 
first, this technique used a statistical control on previous 
ge; second, while the prevalent techniques for measuring 
ferences used neuter items in preference to biased items, the 
‘investigator used biased items in preference to neuter 


se intelligence and transfer ‘of training are intimately re- 
the present writer thought it advisable to examine experi- 
on sex differences in intelligence. The results of this examina- 
Tevealed the general belief that there is no sẹ ence in 
lligence. However, the writer feels that this be 
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upon sound evidence. The prevalent intelligence tests are invalid 
when used for the measurement of sex differences in intelligence 
because of the nature of the items. The accepted procedure for 
selecting items is by the method of ‘fairness to boys and girls’, 
This method results in items, and in turn tests, that do not and 
cannot disclose sex differences in total scores. Surely such neuter 
items are’hot appropriate for detecting sex differences. 

As sex differenc; s intelligence have not been accurately meas- 


and ‘Do boys transfer more than girls?’ Each problem :®p: 
special experimental design with different source materials.” As a 
preparation for the selection of the source materials, numerous sur- 
veys on reading habits and reading preferences were consulted. 
These surveys indicated that boys prefer to read and actually do 
read more about Science; whereas girls prefer to read and actually 
do read more Home Economics. Accordingly, Home Economics 
material was used as source material for investigating whether 
boys transfer more than girls. Should boys be found to obtain 
higher scores on Home Economics, the higher scores could not be 
explained away on the basis of sex differences in previous 
knowledge. 

The writer investigated transfer of training under two different 
conditions when hints are given and when hints are not given. A 
total of three hundred boys and three hundred girls drawn from 
Quincy, Massachusetts; Boise, Idaho; and Charleston, South 
Carolina were used. 

The design for investigating whether boys transfer more than 
girls when no hints are given contained three types of items: (1) 
deduction items, (2) information items, (3) items on principles. The 
deduction items were used as a measure of transfer of training. The 
information items were used as a check on whether the deduction 
items were biased in fayor of one sex. The items on the principles 
were used as a measure of previous knowledge. $ 

The items were selected in accordance with the following specifi- 


=. 
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iaterial. Items selected for the second section were based 
Economies material. 

‘Home Economics items were presented to several females 
as to whether or not the items were as fair to girls as 
'airness in this instance was based upon interest, utility 
pation rather than upon obtained scores. 


the 


SHfems were written in clear, concise language that 
understood by pupils several grades below the eleventh. 

rity was assured for the test group which was composed 
selected from the eleventh grade. 


SAMPLES OF ITEMS FROM SCIENCE MATERIAL 
Information Item 


tends to make air expand.........-csessseneeeneeteereneres THE 
Principle 

air is a good conductor of heat.....--++++eresreeerenteees ie D y 
n Items 


id day, a loose-fitting glove will keep your hand warmer than 


a E a a E bres TF. 
ating material is made loose and fluffy so it will be easier to a 


t is lost through hollow buildi; 
e “till air” does not conduct heat as well as cement does. | 


S OF ITEMS FROM HOME ECONOMICS MATERIAL 


l Information Item 

Is a rule, bluing is used to make clothes appear blue. ...---+-:+ Te 
ns on Principle 

of cloth is weakened by twisting and bending......+++++ 


ing clothes will wear them out...-.--::ssr-7rsrs TT 
1 Information given between pretest and post-test 
Wearing qualities of textiles is sometimes tested # 
of times the fiber of the cloth can be bent ay f i 
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The test was administered to over twelve hundred eleventh- 
grade pupils. From this group, six hundred test papers were se- 
lected for the final statistical analysis. These papers were selected 
on the basis of age and type of high-school program. Test papers of 
pupils between the ages of fifteen and seventeen, and a propor- 
tionate number of test papers belonging to college and non-college 


7 deduetion items, if å stadia. S æ4ributed to sex 


However, reading comprehension is only one of mattygeompo- 
nents that might account for sex differences in the scores On deduc- 
tion items. In order to be reasonably certain that sex differences in * 
scores on deduction items indicated sex differences in transfer of 
training, other possible explanations for sex differences in the 
scores must be eliminated or at least minimized. Many possible 
explanations for sex differences may be eliminated by determining 
whether or not the boys and girls used in this study were unique in 
a way that would invalidate the general conclusion that boys trans- 
fer more than girls. 

How unique is the sample group of boys and girls? This question 
was investigated by comparing boys and girls on the basis of stand- 
ardized tests, of teachers’ marks on school subjects and teachers’ 
ratings of pupils’ personality. 

The pupils in Quincy took the Pintner Intermediate Intelligence 
Test when in the tenth grade, and the mean IQ score for the girls 
(112.5) was found to be slightly higher than the mean score for the 
boys (110.0). Hence, any superiority of boys scores on deduction 
items could not reasonably be attributed to superiority in intelli- 
gence. 

On examining school marks of the experimental group, the girls 
were found to obtain slightly higher marks than the boys in both 
Mathematics and Science. However, teachers’ marks on school sub- 
jects ‘are known to show a ‘halo effect? in favor of girls. This ‘halo 
effect’ may be partly due to sex differences in personality. 

<ageiaing the personality ratings, the girls were found to 
higher rating on ‘initiative’ and a significantly 
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on ‘applicability’. Thus, if boys received higher scores 
items, the higher scores could not be explained away 
of personality traits such as initiative or applicability. 
on the Arithmetic Reasoning test did boys obtain higher 
es than girls and, even then, the difference between the 
as not significant. It may well be that Arithmetic Reason- 
‘elements of deduction and transfer. @ 


mique in a way that would invali% 
ansfer more than girls. j 
apting po 


1d “Who transfer more when helpful suggestions are not 
boys or girls?” This breakdown makes it possible to isolate r 
ect suggestion may have on test scores. To account for sex 
es in reading preferences, reading habits, life goals and 
s knowledge, the two questions were further broken down 


o girls transfer more Science material than boys when help- 
stions are not given? 

}) Do boys transfer more Home Economies material than girls 

elpful suggestions are not given? 

o girls transfer more Science material than boys when 

Suggestions are given? 

Do boys transfer more Home Economics material than girls 

n helpful suggestions are given? 


le 1 is the design for investigating the question, “Do girls 
more Science material than boys when helpful suggestions 
not given?” The schema is paralleled by the actual statistical 


Do girls transfer more Science material than boys when helpful 
ions are not given? The answer seems to be ‘No’; at least, 
requirements of the proof were not fulfilled. The first require- 

that girls should not obtain higher scores on Science informa- 
knowledge of the specific type of Science material used in the 


second requirement that girls do not obtain yee oreson Œ 
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Taste 1—Do Grats Transrer More Tuan Boys? 


Tests on Requirements for proof 


Actual results 


1) Information items | Girls should not obtain] Girls did not obtain 
(Science material) higher scores higher scores P = 

-00,00+- 

2) Items o% principles | Girls should not obtain] Girls did not obtain 

3 higher scores P = 

.0035 

did not obtain 


Akdladia-P- SF 
principles pertinent to deduction items was also fulfilled. The find- 
ings support the view that the boys’ scores on science principles 
were significantly higher than the girls’ scores. Therefore, if girls 
did obtain higher scores on deduction items than boys, the higher 
scores could not reasonably be attributed to sex differences in 
previous knowledge. 

The third and crucial requirement that girls obtain higher scores 
on deduction items on Science was not fulfilled. The data indicate 
that the boys obtained significantly higher scores on Science 
items (P = .00,000) and, therefore, these results can not support 
the view that girls transfer more than boys. 


‘Do boys transfer more Home Economics material than girls 
when helpful suggestions are not given?” may be investigated with 
a new design. Table 2 is the schema with the actual statistical 
findings. 

Do boys transfer more Home Economics material than girls 
when helpful suggestions are not given? The answer is ‘Yes’. All 
three requirements were fulfilled. The data support the view that 
girls have more knowledge of the specific type of material used in 
the test. Hence, any superiority boys may have over girls on de- 
duction items could not reasonably be attributed to a superiority 
in previous knowledge of Home Economics. 

The second requirement that boys do not obtain higher scores 
on principles pertinent to deduction items was also fulfilled. The 
data support this view. Therefore, any superiority of boys’ scores 

j À on deduction items could not reasonably be 
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Taste 2.—Do Bors Transrer More Tuan GIRLS? 


Requirements for proof Actual results 


Boys should not obtain| Boys did not obtain 


ion items 


Economics higher scores higher scores P = 
) .00,000 
on principles Boys should not obtain) Boys did @aot obtain 
Economics higher scores higher scores. Gi 


“go slightly 
Ate Ap = 30 


to a superiority in knowledge of principles useful in 
out the deduction items. 

third and crucial requirement was that boys obtain signifi- 
higher scores on deduction items drawn from Home Eco- 
material when helpful suggestions are not given. This 
nent was fulfilled at a level of significance of P = .00,000. 
ee requirements for proving that boys transfer more than 
fulfilled. The findings, therefore, support the statement 
transfer more than girls when helpful suggestions are not 


n helpful suggestions are given; do girls transfer more in 
material?” may be investigated by the design in Table 3. 


8—Griven HerrruL Svaczstions, Do GIRLS TRANSFER Mors 
Tuan Bors? 


‘Tests on Requirements Results 
a AS ES i 
nation items | Girls should not gain | Girls did gain slightly 
ce material) more than boys more than boys 


-tèst followed by 
test R 
uction items (Sci-| Girls should obtain Girls did not obtain 

material) Pre- higher adjusted higher adjusted post- 
followed by help-| post-test scores test scores 
suggestions and 
jest 
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Do girls transfer more Science material when helpful sugges- 
tions are given? The answer is ‘No’. Neither of the requirements 


was fulfilled. The purpose of the first requirement was to determine , 


whether or not girls gain more than boys by the ‘practice effect’. 
The practice effect was measured by the difference between the 
pre-test and the post-test scores on the information items. In this 
experimei#, the girls were found to gain slightly more than the 
I by the ‘practice effect’ for both Science material and Home 
ics material. The girls’ gain for Science material was 1.35 
hile the boys’ gain was 0. ‘ma For Home Economics 
rial the girls’ gain- e boys’ gain was 
.23 points. The first ouient was PA | rily met. 
Neither was the second and crucial requirement satisfactorily 
met. After helpful suggestions were given afid after allowances 
were made for the effect of previous knowledge of the deduction 
items, the boys were found to obtain significantly higher scores on 
deduction items on Science P = .0014. ‘Allowances’ were made by 
discounting the effect of pre-test score upon post-test score through 
the analysis of covariance. 


Eco! 
poi 


“When helpful suggestions are given, do boys transfer more in 
Home Economics material?” may be investigated by the design in 
Table 4. 

Do boys transfer more in Home Economics material when help- 
ful suggestions are given? The answer is ‘Yes’. Both requirements 


were fulfilled. : 
Taste 4.—Given HeLrruL Succxstions, Do Boys TRANSFER Mors 
Tuan Gris? 
Tests on Requirements Results 
vA i ESE 
1) Information items | Boys should not gain | Boys did not gain 
(Home Economics) | more than girls more than girls 
Pre-test followed by 
post-test 


2) Deduction items | Boys should obtain | Boys did obtain higher 
(Home Economics) | higher adjusted | adjusted post-test 
Pre-test followed by post-test scores scores P = .00,000 
helpful suggestions 
and post-test 


3 
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were found to gain slightly more than boys by the ‘practice 
. Evidence for this statement has been given in the discussion 
third question, and therefore the first requirement of the 
has been fulfilled. * 
nee with a P = .00,00 support the view that boys obtain 
adjusted scores on the deduction items on Home Economics 
when helpful suggestions are given. © 
fence, with both requirements fulfilled, it may be stai fn 
joys transfer more than girls when helpful suggestions 
By synthesizing the above conclusion that boys transfer nf 
girls when helpful, suggestions EER with the previously 
sion that boys transfer more than girls when helpfuksugg' 
sare not given, one is led to the general conclusion that boys 
ally transfer more than girls. 
e conclusion that boys generally transfer more than girls is 
based on the fact that boys obtained significantly higher scores on 
luction items in Home Economics material before suggestions 
given, and also after helpful suggestions were given. 
he fact that boys obtained significantly higher scores on de- 
ion items in Home Economics could riot be attributed to a 
sriority in intelligence, because the boys did not receive higher 
gence test scores. 

The higher scores of the boys could not be due to the fact that 
there was a superiority in ‘applicability’ and ‘initiative’, as the 
s had received significantly higher ratings from their teachers 
the personality trait labeled ‘applicability’ and ‘initiative’. 

f The higher scores that boys received could not be explained on 
basis of reading preference and reading activities because sur- 
s have shown that girls prefer to read and actually do read 
about Home Economics than boys. 
he boys’ higher scores on deduction items could not be ex- 
ed on the basis of previous knowledge because the scores of 
boys were lower than those of the girls on information drawn 
m Home Economics material. i 
"The difference in the scores could not be explained on the basis 
knowledge of the principles pertinent to the deduction items. 
e girls’ mean score on the principles was slightly higher than the 
mean score. 
The boys’ greater gain in scores from pre-test to post-test on 
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deduction items cannot be attributed to superiority in the practice 

effect. Girls were found to gain slightly more than boys by the 

practice effect. ’ 5 

A superiority in the scores could not be attributed to a sex differ- 

ence in reading comprehension because the boys and girls used in 

the investigation were experimentally matched on the basis of 
—eqding comiprehension. 
, it may be said that since the boys’ superior mean 
wh the deduction items is not due to intelligence, to previous , 
Tige, to reading comprehensjon, to reading preferences or 
activities, to personality traits such as applicability and 
itiative, to practice effect, or knowledge of pertinent principles, 
it is reasonable to conclude that the higher scores on the deduction 
items are due to boys’ superior ability to transfer. 


. 


PTION IN NUMBER SKILLS—A STUDY 
IN TACHISTOSCOPIC TRAINING’ 


JOHN L. PHILLIPS, JR. 


sal of the literature on tachistoscopic training iscloses 
at appears to the writer as a single basic issue, with two major 
growing out of it. The basic issue is between tw 
y, called in the psychology of reading, ‘peripheral’ 
al’, (These terms can be extended to refer to any skill fhyolv’ 
use of the higher mental processes). The two problems 
g out it are (1) can peripheral training improve performance 
skills? and (2) can central training improve performance 
skills? . 

answer to the latter question is so obviously affirmative as 
lude any extended discussion. No one would question the 
ity of concept formation as a prerequisite to reading, for ex- 
, or to recognition of aircraft, or to picking out the imperfect 
on an assembly line. The other proposition is not so apodictic; 
‘the question that has provided stimulus for the present study. 
peripheral training improve performance in mental skills?— 
specifically, can this type of training use short-exposure 
ods to advantage? 

Tt can be stated with considerable assurance that performance 
reeptual tests of various kinds can be improved by practice 
e reproduction of briefly-exposed stimuli. But does this im- 
ment in laboratory tasks transfer to skills involved in scho- 
tic and other more utilitarian performances? 

ain the answer is seemingly affirmative; but when only those 
ies are considered that include trained control groups—i.e., 
trained by one or more alternative methods—it becomes 
ent that relatively little evidence is available. 

enbach, working with normal (2) and later with feeble- 


"Adapted from a thesis submitted to the Graduate School of the Univer- 
‘Utah and to the Department of Educational Psychology in partial 


ment of the requirements for the Ph. D. degree. The study was sup- 
i ‘ommittee and 
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minded (3) school children found a general increase in scholastic 
achievement as compared with a traditionally trained group; 
Renshaw (8, 10), and Melcer and Brown (6) confirmed him. Ren- 
shaw (9) also reported a rather startling superiority for tachisto- 
scopic training in reading at the college level. Brown (1) used the 
instrument to advantage in teaching spelling to seventh-graders. 
—Minger’s (1 i) college students in beginning typing and the Navy’s 
(Neville, 7) aircraft recognition trainees performed better when 
trained by tachistoscopic methods. ; 

p other hand, Weber (13), Sutherland (12), and Freeburne 

t up training programs in reading for college students in such 

‘way that the gains of the tachistoscopically-trained groups could ` 
be compared with those of groups trained by alternative methods, 
and although they found large gains being scored by their subjects, 
these gains were shared equally by all groups. Hoyt, et al. (5) 
tried the same thing in a college art course, and with the same 
result. 

Taken as a whole, then, the evidence indicates a considerable 
value for short-exposure methods; but the evidence is far from con- 
clusive, for there are important experiments on record in which the 
results are definitely negative. These methods have failed to estab- 
lish decisively their superiority over the more traditional tech- 
niques with which they must in the last analysis compete for 
preéminence as clinical and classroom procedures. 


AN EXPERIMENT IN ARITHMETIC TRAINING 


Computational skill, defined as the ability to respond consist- 
ently and correctly to the presentation of certain number combi- 
nations, was selected to be the focus of the investigation reported 
in this paper. An experiment was conducted in the fourth grade of 
a Salt Lake City, Utah, public school (first experiment) and then 
repeated in another school in another part of the city (second ex- 
periment). The primary purpose was the testing of two hypotheses: 
(1) that tachistoscopic training is effective in the teaching of com- 
putational skills, and (2) that taehistoscopic training is more effec- 
tive than a more conventional, paper-and-pencil approach. (Other 
problems were considered along the way, and will be discussed in 
the section on conclusions, but they were not the central objec- 
tives.) 


ə 
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e experimental groups and a control group were equated in 
of intelligence (mean §1?, approximately one hundred), 
in school (fourth), sex, visual acuity, and number of training 
present. Inasmuch as the classes had already been organized 
the experiment began, it was necessary to do this equating 
groups by casting out cases after the experiment had been com- 
. N for each of the equated groups in the firs(experim 
as ten (five girls and five boys); while in the second experiment 
re were five girls and nine boys, or fourteen in each group. In 
first experiment Group IV, the control group (see below, up 
was eliminated as a result of a procedural error by the teatiagr 
e; in the second, all four groups were studied. In both e; 
ents, the first half of the training was supervised by the 
classroom teacher (the same teacher for all groups), while 
the latter half, the experimenter was in charge. Conditions of 
ining were as follows: 
oup I: a paper-and-pencil procedure in which the practice 
were like the pages in a workbook. 
oup IT: an adaptation of the tachistoscopic method developed 
illey and Billington and published in the Keystone teaching 
ual. It consisted primarily of reproducing (on paper) complete 
mber combinations (both sides of the equation sign) when these 
flashed onto a screen. 
Group III: developed by the writer for these experiments; it also 
de use of the tachistoscope, but in a different way. Instead of 
hing the entire combination onto the screen in a single exposure, 
teacher flashed first the problem and then, after a short pause, 
‘answer. In the 3-sec. interim between ‘problem flash’ and ‘an- 
flash’, the subject attempted to recall the correct answer and 


Toup IV: planned as a control in which no special training 

uld be given on the set of combinations specified for training of 

the other groups. This control group merely continued with its 

classroom work, encountering the experimental combina- 

lions only as they happened to occur in textbook assignments and 
With no special attention being directed toward them. 

The amount of time spent in arithmetic classes per day was the 


The SI (sigma index) is a standard score in & distribution of Salt Lake 
y children’s scores with a mean of 100 and a standard deviation of 20. © 
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same for all groups. The amount of time spent per period in work- 
ing on the experimental combinations* was equal for all excepting 
the control group, where it was not measurable but was known to 
be very small. 
The training series comprised twenty multiplication and twenty 
division combinations selected primarily from the course of study 
for the ne% higher grade (fifth). These were typed onto trans- 
parent slides and projected in .11 sec. exposures‘ by a Keystone 
tachistoscope—an overhead projector with a diaphragm shutter 
ent. 
ent of arithmetic gains made during training was done 
means of a battery of arithmetic tests given before and after 
training, which had been constructed especially for the purpose 
and which presented the trained combinations in each of four ways: 
(1) horizontal, original order—exactly as presented in training; 
(2) vertical, original order; (3) horizontal, reversed order; and (4) 
vertical, reversed order. The effect of training on perceptual 
ability was tested by asking subjects also before and after training 
to reproduce on paper numbers of varying length (one through 
nine digits) when flashed at .03 sec. Slides were made in the same 
way as were those in the training series, and exposures were made 
by the same equipment. : 
The data were analyzed into five categories: Perception;® Hori- 
zontal, Original Order;? Multiplication; Division;? and Arithmetic, 
* Total!®; these were converted into difference scores (‘after training’ 
minus ‘before training’) and subjected to analysis of variance in all 
categories, 


* The length of practice periods varied, but they averaged approximately 
ten minutes for the two experiments. In the first experiment there were 
fourteen daily practice sessions; in the second there were twelve. 

t Calibrated ‘1/25 sec.” on the face of the Flashmeter. 

7 9 

* For example: (1) 9X7= ; (2) X9; (3)7x9= ; (4) XZ; or 0) 
63+ 9= ; (2) 9/63; (3) 63 +7 = ; (4) 7/63 

© As measured by the test described in the paragraph above. 

? Including both multiplication and division. 

8 Including both horizontal and vertical, both original and reversed. 

* Including both horizontal and vertical, both original and reversed. 

10 Including multiplication and division, horizontal and vertical, an 
original and reversed. 
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* 
DIVERGENT RESULTS FROM THE TWO EXPERIMENTS 
ts from the two experiments can be characterized as con- 
t (in basic agreement) or divergent (in disagreement). The 
are important insofar as they throw doubt on the former. ` 
re the data were converted into difference scores, three of 
raining measures were singled out for examinatjjon of their 
tions; these three (the total score from all the arithmetit™ 
the perception test score, and the intelligence test score) - 
re selected because of the theoretical significance that attaches 
tions among them. 
the basis of the theory advanced by proponents of taghist 
‘training, there should be some correlation between per- 
ce in any intellective function and the perceptual skill 
h, according to the peripheral theory, underlies it. For exam- 
arithmetic test scores and perception test scores should be 
ively correlated, as should intelligence test scores and arith- 
test scores. Data from the two experiments do not agree on 
4 3 point: the first experiment showed no correlation, the second 
duced coefficients of r = .30 and r = .25, respectively, for the 
correlations. Both are beyond the .05 level of confidence. 
the total N in the first experiment was only thirty, while that 
he second was fifty-six, differences in significance levels may be 
in part to the difference in N. 
When the data were converted into difference scores, a glaring . 
pancy appeared between the two experiments: the first one 
d no significant F-ratios in the analysis of variance; while 
e second, with the single exception of the perception analysis, 
the F’s were significant. This, too, could be the result of a 
N in the second experiment, though another factor that 
btedly influenced the size of the F-ratios was the presence 
mtrol group in the second experiment. In every analysis, the 
differences were between this one group and all the others. 
no control group in the analysis, the F-tests in the first ex- 
iment were handicapped not only by a smaller N, but by loss 
the greatest contributor to ‘between-groups’ variance. 


CONVERGENT RESULTS FROM THE TWO EXPERIMENTS 


might be fruitful to accept tentatively the above explanation 
lack of significance of ‘between-groups’ differences in gain © 


© 
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scores obtained in the first experiment, and to compare the direc- 
tion of differences obtained in the two experiments wherever these 
are shown to be significant in the second experiment. 

The second experiment produced a significant difference in favor 
of Group I over Group II and of Group I over Group III in every 
analysis but, the one on perception; differences in the first experi- 

<==wnt were also in this same direction. No significant differences 
appeared between Group II and Group III in either experiment. 

In the second experiment Groups II and III were significantly 

zr to Group IV in every analysis but the one on perception. 


Every reason to believe that such a group would have gained even 
less during the training period than did the control group actually 
used in the second experiment. The former would have had no ex- 
perience whatever with the forty experimental combinations"; 
while the latter did encounter some of them in a haphazard manner 
during the time that the experimental groups were being trained. 
Thus, in every case where the second experiment produced a 
significant difference in a given direction, a difference in the same 
direction was found to have emerged also from the first experiment. 


DISCUSSION 

The statement was made as an introduction to this paper that 
its major objectives would be the testing of two hypotheses: (1) 
that tachistoscopic training methods are effective in the teaching 
of number skills, and (2) that they are more effective than a con- 
ventional workbook method. 

Actually, there was implicit in the design of the experiments & 
more precise statement of the second hypothesis; the development 
of an original tachistoscopie method for comparison with the work- 
book method, but also with the Willey-Billington method, implied 
the further hypothesis that this new tachistoscopic technique 
would prove superior to the established one. This hypothesis was 
based on a conviction that a period of striving for recall—the 
‘challenge period’, the writer calls it—was important’ in learning 
and was missing from the procedure described by Willey and 
Billington. Their method, on the other hand, was expected to prove 
superior to the workbook approach because it was designed to rid 


11 The course of study did not call for these particular items until later 
in the year, 


p \ 


ere was no control group in the first experiment, but there is , 
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s of their “uneconomical and disorganized habits of calcula- 
in favor of more efficient direct responses in the form of a 
tinuous emergence of precise, economical, mature patterns of 
ng.” (14) Hypothesis 2, then, really involves a ranking of the 
experimental methods of training with reference to their 
ness in producing improvement in number skillg. 


bout the relationship of computational skills to perception: 
‘subjects who are superior to their fellows in perceptual ability 


2 extent dependent on the former; (2) improvement in. perce 
ability is accompanied by an increased effectiveness in com- 
ition; and (3) the tachistoscopic training used in these experi- 


he entire hypothetical structure, together with pertinent re- 
from both experiments, is summarized in the following 


a PERCEPTION 


1) “Subjects who are superior in perceptual ability are also 
üperior in number skills.” The validity of this hypothesis can be 
her confirmed nor rejected on the basis of correlation coeffi- 
obtained before training between perception test scores and 
etic test scores: in the first experiment, the correlation was 
nificant by inspection of the scatterplot, while the second pro- 
ed a correlation of .30 which is significant beyond the .05 level 
onfidence. The insignificant r from the first experiment may be 
in some measure to the smallness of the N. 

“Improvement of perceptual ability results in improvement 
umber skills.” This hypothesis could not be tested, primarily 
se of the results reported under hypothesis 3, below. 

The tachistoscopic training used in these experiments is effec- 
in developing perceptual ability of the type tested.” This 
othesis must be rejected on the basis of the analysis of variance 
Perception test difference score: F-ratios in both experiments 
insignificant, and since the lowest difference score was m each 
actually a loss of proficiency, it can be safely concluded that 
significant gain was made by any of the seven groups tested. 
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Iso implicit in the design of the experiment is a set of hypotki= 


g 


@ 


466 The Journal of Educational Psychology 


NUMBER SKILLS 


1) “Tachistoscopic training is effective in the teaching of com- 
putational skills.” This hypothesis can clearly be accepted. In the 
_ first experiment, which lacked a control group, all of the experi- 
mental groups made significant gains in arithmetic test scores of 
the four t@pes analyzed; in the second experiment, there was in 
ery analysis a significant difference between the gain made by 
the control group and the smallest of the gains made by the ex- 
perimentals. $ 

‘The three experimental methods can be ranked according to 
eir xelative effectiveness in the following order: III, II, I.” This: 
hypothesis must be rejected. In the first experiment, the analysis 
of variance did not disclose significant differences between the 
groups; but the N was small, and all differences were in the same 
direction as the significant differences obtained with a larger N in 
the second experiment. Both experiments ranked the methods as 
follows: Group I (workbook) first, Groups II and III (tachisto- 
scopic) second; Group IV (control) was last in every analysis. 

Caution must attend the extension of these interpretations to 
situations other than the ones described here. Previous experi- 
ments with subjects at the primary-school level, for example, have 
indicated that the tachistoscope can be of value to teachers in a 
number of different areas, including arithmetic. 

Tt should be noted also that a good many interaction effects are 
possible which could not be examined in the present study. Controls 
were attempted of all the variables that seemed at the outset most 
likely to influence the results; but on the one hand it is impossible 
to control every factor that could be included, and on the other 
hand variance contributed by these factors might prove interest 
ing in its own right. Had conditions permitted, a multivariate 
design would have been more rewarding. we 

The results might be challenged from another point of view: it 
could be objected that the standardization that was necessary for 
a clear differentiation of methods had the effect of rendering them 
all stilted and artificial. The criticism is well-taken; it is undoubt- 
edly true that each of the three experimental methods suffered 
to some extent from the limitations imposed by experimental con- 
trols. On the other hand, it is also true that the methods were 
carefully planned to get the most out of each within these limita-. 
‘sap p oa is no apparent reason why the enrichment that 
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possible in a non-experimental situation should affect 
d any more than another. 


SUMMARY AND CONCLUSIONS 


is investigators of tachistoscopic training have not come 
greement regarding its effectiveness with learners of vary- 
and backgrounds working with many differen? materials. 
apparently specific to particular constellations of these 
‘and conclusions must be limited to the one constellation 
characterizes a particular investigation. The present qfudy 
ned with the effects of fourteen 8-min. training sessi 
ade, public-school children working only with aritiimetil 
tions (multiplication and division). The writer concludes 
1 this situation tachistoscopic training is effective in the 
g of number skills but that it is not more effective than an 
y workbook method of practice—in fact, the two tachisto- 
methods used proved to be actually less effective than the 
ook method. Tests of rapid perception showed no gain in 
the groups over the period of training. 
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ecent researches in the field of behavior have high-lighted the 

pts of empathy and personal values. These areas appear to 
promise for students of behavior. The ability to empathize 
mportant in social intercourse. It appears also to be a fruitful 
ce of understanding individual differences in certain areas of 


tion in such fields as counseling where an understanding of condi- 
tions of living desired by the individual is important. In these areas, 
es form a triad with aptitude and interests. 
e purpose of this research has been to explore somewhat the 
tionship between empathy and personal values. It is felt that 
gh continued research in these areas human behavior may 
become less and less an enigma. The postulates of George H. Mead 
have been very fruitful in throwing light on the functioning 
structuring of the mind and the self and their relation to 
ety. The ‘generalized other’ of which Mead speaks has given 
my hints in the direction of a fuller understanding of the nature 
social intercourse. Particularly when Cottrell’s rôle theory (1) 
‘added, one gains much insight in taking the rôle of the ‘other’. 
key to unlock many dilemmas in 


e use of the term empathy. In the use to which she has put it, 
is no reference to the sensations of inanimate objects being 
ized but rather reference is made to the internalizing of @ 


469 Q 
G 


T a 


470 The Journal of Educational Psychology 


the affect and cognition of the ‘other’. Dymond defines empathy as 
follows: “Empathy denotes the imaginative transposing of oneself 
into the thinking, feeling and acting of another and so structuring 
the world as he does.” 
Woodruff (6), using the value concept as a basis, has developed 
a theory of motivation which holds much promise. He defines 
<=nersonal values as generalized statements of conditions of living 
which the individual believes have an effect on his well-being— 


positive—negative—neutral. The measurement of values under his , 


ion has added to the fields where psychometrics attempts to 


t and predict. Egbert ($) has refined the measurement of | 


alues to a point where objectivity is possible on the part of the 
psychometrist and his scale has sufficient statistical validity and 
reliability for survey purposes. 

Dymond describes her Rating Test as follows: “The test was 
made up of four parts, each containing the same six items. In the 
first part the individual was asked to rate himself, on a five-point 
scale, on each of six characteristics. In the second part he was asked 
to rate the other individual as he believed this other would rate 
himself. In the third part the individual rates the other person as 
he thinks the other person would rate himself. In the fourth he 
must rate himself as he thinks the other would rate him. In other 
words, if two individuals A and B are being tested for their empathy 
with each other, the procedure would be as follows: 

“A. Part 1: A rates himself, (A). Part 2. A rates B as he (A) sees 
him. Part 3. A rates B as he thinks B would rate himself. Part 4. 
A rates himself (A) as he thinks B would rate him. 

“B, Part 1. B rates himself, (B). Part 2. B rates A as he (B) sees 
him. Part 3. B rates A as he thinks A would rate himself. Part 4. 
B rates himself (B) as he thinks A would rate him. 

“Therefore a measure of A’s empathic ability can be derived by 
calculating how closely his predictions of B’s rating, (A 3 and A 4), 
correspond with B’s actual ratings (B1 and B2). Similarly a meas- 
ure of B’s empathy with A can be obtained by calculating how 
closely his predictions of A’s ratings, (B3 and B4), correspond to 
A’s actual ratings (A1 and A2). 

“The six traits which were used as the items in all four parts of 
the test were: 1. self-confidence, 2. superior-inferior, 3. selfish- 
unselfish, 4. friendly-unfriendly, 5. leader-follower, 6. sense of 
humor, 
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ugh the usual objections to such trait-ratings were recog- 
is procedure was followed none-the-less because the rat- 
not being used to determine the personality of the sub- 
to determine how accurate the others were to their 
on of this. The test was designed to answer the question 
can the subject transpose himself into the thinking, feel- 
acting of the others. If he can do this he shoul be able fo 
how the others will behave in certain defined situations. 
tuation chosen to test this ability was the subject’s ability 
dict how others will rate themselves and how they willwate 
these six traits. (A high score indicated low empal 

‘and a low score the inverse.)” G 

the purpose of this study an adapted form of Dymond’s 
devised. This form may be used with an I.B.M. answer 


) 4g “A Study of Choices Form VII” gives fifteen values for 
sonal value pattern. He lists and describes the values as 


(3). ġ 
hetic Appreciation—Appreciation of beauty and quality of 


Comfort and Relaxation—Primarily physical comfort with 
me reference to ‘mental comfort’. f: 
Excitement—Experiences which are unusual and which give a 


Freedom—Lack of physical, mental or moral restraint from 
xternal source. 
Friendship—Personal attachments to people outside the 
Ue. 
Family life—The husband, wife and children with the inter- 
onships that exist among them. 

tellectual Activity—Mental activity as an end in itself. 
Personal Improvement—Improvement of appearance 
y with the social graces. 
Power and Control—Control over associates and followers. 
Recognition—Prestige as such, apart, in-so-far as possible, 
Religion—Formalized religion with particular emphasis on 

a, Supreme Being and a life after death. : : 
Privacy—Being alone and having personal belongings which 
e kept from other people. 

Security—Financial and physical security. 
G i 
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14) Society Life—Mingling with the social upper class and being 
active, in clubs and night life. 

15) Social Service—Doing things for other people: The Golden 
Rule. 

The device is based on a ranking method. Egbert states there 
are thirty groups of five statements of values. The problem is to 

the atements in each group according to their importance 
or the happiness of the respondent. Each value appears ten times 
in the test and each value is in the same group with each other 
value at least twice and not more than four times. i 
e test correlates well with Woodruff’s Study of Choices. In- 
orma interview has also indicated its validity. Reliability coeffi- 
cients based on the Froelich index of test reliability have a median 
coefficient of 0.690. 

Eighty college sophomores and juniors, students of elementary 
educational psychology, were selected for subjects. Egbert’s value 
test was administered to derive a value pattern for each subject. 
A questionnaire was filled out by each subject in which he ranked 
the other subjects from ‘know casually in class’ to ‘close friends’. 
Four subjects were chosen for a grouping for the administration of 
the empathy test. The subjects were selected to be in the same 
group only if their degree of friendship was more than ‘casually in 
class’ and less than ‘close friends’. 

Dymond’s empathy test was administered to the population and 
scored according to the deviation score. This gives each subject an 
‘empathic ability’ score and an ‘ability to be empathized with’ 
score. 

Table 1 is concerned with the twenty subjects from the popula- 
tion who were lowest and the twenty subjects who were highest 
for each value pattern concerned. Their empathic scores are also 
shown in this table. 

A difference significant only at the .10 level was found between 
the low social service and the high social service groups. ‘This 
difference favors the subjects having the greatest empathic ability. 
The group nature and social intercourse supposition of socia 
service would be in favor of those having the greatest empathic 
ability. 

The value tests were tabulated for deviation; that is, a score was 
derived for each individual depending upon his value deviations 
from the theoretical mean of 30 for each value. Either a positive 0° 
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—EMPATHIC ABILITY FOR Grours HrGH AnD Low ror Eacn VALUE 


Total | High | Low | Mead | e |Tgvelof 
23.72 | 25.60 | 23.40 2.20 
23.72 | 25.30 | 22.95 2.35 
23.72 | 23.35 | 22.55 | 0.80 | @ 
23.72 | 24.35 | 23.80 0.55 >a 
23.72 | 24.10 | 24.65 0.55 
23.72 | 21.10 | 24.50 2.90 
Intellectual act. 23.72 | 22.90 | 23.95 | 1.05 
8) Personal improv. | 23.72 | 23.30 | 23.30 | 0.00 
ower and control | 23.72 | 24.75 | 23.90 0.85 o 
23.72 | 23.10 | 24.50 1.40 
23.72 | 23.75 | 23.75 0.00 
23.72 | 23.70 | 24.20 0.50 
23.72 | 23.55 | 25.20 1.65 
23.72 | 24.00 | 25.20 | 1.20 i 
Social service 23.72 | 21.45 | 24.55 3.10 1,89 10 
Deviation score 23.72 | 23.70 | 25.85 | 2.15 
s N-80 | N-20 | N-20 * 


ive deviation was added as it was felt the direction of devia- 
| was not the significant factor. The means for the high twenty 
the low twenty in empathic ability were tabulated and then 
for statistical significance. A ‘t’ score significant at the .05 
l was yielded. This indicates a statistically significant greater 
ount of deviation was found to exist for those subjects with 


might appear in the value pattern of a less mature person. He 
efore may need less effort to determine his own status and more 
can be expended to clarify relationships. : 

Table 2 is the companion of the preceding table. It indicates the 
n ability to be empathized with for the high twenty and low 
enty for each value named. Significant differences were found to 
or family life at the .02 level for the higher group. The value of 
ity was found significant at the .05 level. Again, the high 
ips had the higher mean. Social service value yielded a ‘t’ of 
which was significant at the .01 level. The high empathic 
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TABLE 2—“ABILITY TO BE EMPATHIZED WITH” FOR GRouPs HIGH AND 
Low ror Each VALUE 


Dif between 
Total | High | Low | High and t level 
1) Esthetic apprec. 23.83 | 25.70 | 22.55 3.15 
2) Comfo3) 23.83 | 25.95 | 23.60 2.35 
= 38) Excitement 23.83 | 25.20 | 22.70 2.50 
4) Freedom 23.83 | 25.60 | 24.05 1.55 
5) Friendship 23.83 | 24.30 | 25.05 0.75 
6) Family life 23.83 | 22.20 | 27.40 5.20 2.77 02 
tellectual act. 23.83 | 23.70 | 25.90 2.20 
) Personal improv. 23.83 | 24.05 | 23.35 0.70 
9) Power and control | 23.83 | 23.35 | 23.95 0.60 
10) Recognition 23.83 | 24.80 | 23.35 1.45 
11) Religion 23.83 | 22.40 | 26.30 3.90 1.99 -10 
12) Privacy 23.83 | 25.40 | 23.10 2.30 
13) Security 23.83 | 21.35 | 25.60 4.25 2.51 .05 
14) Society life 23.83 | 26.50 | 23.30 3.20 
15) Social service 23.83 | 21.75 | 26.05 | 4.30 |2.54| .02 
16) Deviation score 23.83 | 23.85 | 28.45 4.60 2.86 01 
N-80 | N-20 | N-20 
ability group had the highest mean. Religion was found to be sig- 


nificant at only the .10 level for the high group. These values are 
all important in group living. Those individuals who function well 
in social relations appear to also lead in values of a social nature. 

The table also indicates the significant differences in the devia- 
tion scores. The high group had a greater deviation score, signifi- 
cant at the .01 level, reflecting what may be thought of as greater 


maturity. 


TABLE 3.—Vauun PATTERNS ror Ten SUBJECTS WITH Waom OTHERS 
Empatuize Best 


Values Values 
1) Family Life 41.40 || 9) Power and Control 28.40 
2) Friendship 38.30 || 10) Recognition 27.60 
3) Security 37.50 || 11) Intellectual Activity 26.80 
4) Personal Improvement | 34.60 || 12) Society Life Be 
5) Social Service 34,40 | 13) Excitement aon 
6) Freedom 33.00 | 14) Privacy D 
7) Religion 33.00 || 15) Esthetic Appreciation 19. x 
8) Comfort 28.90 || 16) Deviation Score 100.7 
Te ER 
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mposite value pattern for the ten subjects with whom 

Ts empathize best is shown in Table 3. It is noted that the val- 

that rank highest are, in general, those which are heavily 
by social intercourse. 

correlation between empathic ability and the ability to be 
ized with was found to be +0.335. Research by Dymond’ . 
ated the finding of a correlation of +0.60 between ema <a 
ic ability and ability to be empathized with. 


SUMMARY AND CONCLUSIONS 


ond’s Rating Test for measuring empathic ability 

¥s Study of Choices, Form VII were administered to dighty 
students of educational psychology. They were grouped in 
‘ways to reflect certain value pattern combinations. 

g were tabulated statistically to determine degree and 
‘relationship existing between personal values and empathy. 
ts indicate that the subjects with high empathic ability— 

ly the ‘ability to be empathized with’—tend to have the 
values in the areas where group interaction and social in- 

are major factors. Where the high values are less depend- 
on group life and can be satisfied by individual action, 
athy scores appear to be lower. 
onclusions that certain other factors are operating in the rela- 
ip appear to be warranted. They are: 
Empathy appears to be related to the amount of dispersion 
n a given pattern; i.e., if the value pattern is flat, or has few 
empathy scores are lower. 
impathy seems to be positively related to such value factors 
life and social service. For the values comfort, power and 
, and society life an inverse relationship is indicated. 
correlation coefficient of 0.335 was found between empathic 
and the ‘ability to be empathized with’. 
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COLLEGE GRADES AND SELF-ESTIMATES 
OF INTELLIGENCE 


ORVILLE G. BRIM JR. 


University of Wisconsin Ca 


‘This paper reports a study of the relationship between individual cig 
elf-estimates of intellectual rank and college academic perform- 
as measured by individual grade-point average. The study 
tinues some of the exploratory work on the influence of ni 
tive factors upon high levels of academic and occupdtior 
nent which began in another research program.’ 
major proposition of the study is that the grade-level at- 
by an individual varies directly with his own estimate of his 
nce. The relevant data are those showing the relationship 
the subject’s actual grade-point average and self-estimate 
elligence. 
is further proposed that this relationship results in part from 
dency of the grade-point average sought by an individual, as 
as the strength of his motivation to achieve it, to vary directly 
his own estimate of his intellectual rank. Thus two types of 
relevant data here are the relationship of responses to questions 
about grade aspiration and about motivation to self-estimates of 
intelligence. 


METHOD 


“Questionnaires were administered to one hundred and three 
tudents in an advanced social psychology class (no freshmen, 
eleven sophomores) at the University of Wisconsin. 

"Data on self-estimated intelligence were collected in the follow- 
ing way. Each respondent was asked to assume that he was among 
one hundred students taken at random from his complete class at 

the University, and that this group of one hundred was then given 

‘series of intelligence tests and ranked from 1 (high) to 100 (low) 
‘on general intelligence on the basis of the test results. The respond- 


C "The writer was formerly a staff member of the project, “Cultural Fac- 
tors in Talent Development,” under the general direction of Fred L. Strodt- 
ck of Yale University. © 
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ent was then asked to estimate what his position or rank would be 
among these one hundred students. 

Respondents were given the choice of signing their names or of 
leaving the questionnaire anonymous. Of the one hundred and 
three respondents, seventeen did not identify themselves. There 
were no differences between signers and non-signers on the charac- 
th teristics @ sex and of class in the University. The non-signers’ 
self-estimates of intelligence were higher than those of the signers, 
the mean of the estimates being the 31st rank in 100 for the former 
ang the 39th rank for the latter. A ¢ test, however, (2, Ch. 12), 
ows the probability of this difference to exceed the .05 level, and 
therefore in subsequent analysis the two groups are considered 


“together where appropriate. 


For the respondents who identified themselves it was possible to 
get both the cumulative grade-point average and a measure of 
their intelligence from University records.? The intelligence meas- 
ure was the American Council Psychological Examination (1948 
edition). The A.C.E. test norms used in computing the percentiles 
were those for the entering class of the individual in question. 


- 


RESULTS 


1) Relationship of Actual Grade-point Average to Estimated Intel- 
lectual Rank—In our analysis it is necessary to separate the in- 
fluence upon grade-point average of both actual and estimated 
intelligence. The correlation? between these two intelligence meas- 
ures was .43, The subjects in general overestimated their intellec- 
tual rank. The mean estimated rank for identified subjects, 98 
noted above, was the 39th. Their mean rank on the A.C.E. was the 
52nd percentile, which when reversed in direction to be comparable 
to our data becomes the 48th. Hence the subjects on the average 
amos their rank to be nine percentiles higher than the test 

ts. 

The relationship of both actual and estimated intellectual rank 
to grade-point average is shown in simple form in Table 1. The cell 
values were generated by cutting the rank order of both actual in- 
telligence and estimates at the median, so as to most equalize the 


2 The writer is indebted to Dr. Joseph L. Lins, Director of Student Per- 
sonnel and Studies at Wisconsin, for his assistance in this task. 

3 Pearsonian correlation coefficients are used throughout. See McNemar 
(2, Ch. 6.) 
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1— MEAN GRADE-POINT AVERAGE BY A.C.E. AND ESTIMATED 
? INTELLECTUAL RANK* 


ms of letter grades, 1 equals C, 2 equals B, 3 equals A. 


ion, and then computing the mean of the grade-point 
of the individuals in the cells. 
‘erences between means show the influence of actual in- 
which is to be expected. But the differences also show 
with actual intelligence roughly constant, those persons who 


eir actual and estimated intelligence are .31 and .32, re- 
, which are significant at the .01 level. 
the linear effects of actual intelligence held constant, the 
| correlation (3, Ch. 9) between grade-point average and 
intelligence is .20. The partial correlation between actual 
ence and grades with estimates held constant is also .20. A 
22 would be necessary to reach the .05 level of significance. 
terpretation is that both variables, actual and estimated 
rank, are making a small but independent, contribution 
ferences in grade-point average, with individuals presumably 
influenced predominantly by one or the other, 
‘multiple correlation of both measures of intelligence with 
le-point average was .38, indicating that either of the simple 
ions is almost as good a predictive measure as are both 
taken together. None of the correlations, however, is 
enough to be of much value in prediction.‘ 
elationship of Desired Grade-point Average to Estimated Intel- 
Rank.—Our predictions were that a high estimate of intel- 
rank would be associated with a high grade level sought by 
ion to the correlation data, a simple Guttman coefficient of 
y was computed for actual and estimated rank combined. In 


se C equalled .39. ¢ 
y ® 
& 


Estimated Rank 
Low High 
A.C.E. Rank AQQE. Rank 
Low High Low High ® O 
1.54 | 1.67 | 1.76 | 1.97 
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the student, and a strong motivation to achieve this level. Such 
predictions would follow from the theoretical work of Horney (1), 
Murphy (4), Sherif and Cantril (5) and others. 

An individual’s self-image is composed of segments, e.g., one 
thinks of himself as big, strong, and handsome. Where segments of 
the self-i are valued by the individual, ego-motives arise. 

» «These a are directed to goals which, if achieved, the indi- 
vidual can interpret as evidence that he is in fact what he considers 
himself to be. 

e segment of self-image involved in this study is one’s con- 
ption of himself as ‘bright’, ‘average’, ‘dumb’, and the like. Most. 
normal individuals in our culture positively value such conceptions 
as brightness; i.e. conceptions of being highly intelligent. Indi- 
viduals who have such self-conceptions of high intelligence are 
motivated to protect this conception, to act in such a way that 
corroborating evidence results. 

Among American college students, the achievement of high 
grades is customarily held to be evidence of high intelligence. 
Individuals among this group who conceive themselves as ranking 
high in intelligence would therefore normally seek high grades as 
evidence for this conception. 

However, if we hold that grades are taken as evidence of one’s 
intellectual abilities, then we must recognize at this point the 
criticism that the correlations reported above may simply reflect 
the fact that the self-estimates are based on grades, and do not 
stand in any causal relationship to them at all. Our answer is that 
the relationship between college grades and self-estimates of in- 
telligence is probably a fluid one in which a gradual accommodation 
is reached between the self-conception and the realities represented 
by grades. The remainder of the paper presents data which indi- 
cate that self-estimates do play a causal rôle in determining grades, 
and that therefore the correlations reported above represent 17 
part the adjustment of grades to self-conceptions, and not solely 
the reverse.® 


5 An experimental design giving an adequate answer to this question 
would be one in which randomly selected groups of entering college students 
were given information on their intellectual rank, with one group being 
placed above their actual test scores, another below, and so on. The ae 
could then be followed through college, and their achieved grades sompa 
with the grades predicted on the basis of customary psychometric tech 
niques. @ 
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BLE 2.—CHANGE IN GRADE ASPIRATION WITH AssuMED RISE IN 
INTELLECTUAL RANK 


Response Number of Subjects 


Try for better grades 
Try for less 

Not change at all 
No answer 


wo8od 


tendency of aspiration level concerning grades to vary with 
belief about his intellectual rank was explored by means f 
hetical situations. Each subject was asked to assume that 
ound out that he was much more intelligent than he thought, 
true rank was much higher, and then was asked to indicate 
ether he would ‘try for better grades’ than he was trying for 
try for less than now, or ‘not change feelings at all about 
ng for grades’. The distribution of responses support the pre- 
liction. These are shown in Table 2. 
The same information was collected again, but with each subject 
d to assume that he had discovered that actually his intellec- 
rank was much lower than he thought. The distribution of 
nses is given in Table 3. 
he writer can offer no systematic explanation for the responses 
e eighteen subjects who would try for better grades as a result 
finding they were much less intelligent than they thought. 
regard to the ‘non-changers’ in both situations, it seemed 
able that these were subjects who fall toward extremes in 
imates. Those who already have high self-estimates would 
ly to raise their aspirations on finding they are even more 
igent. Conversely, those subjects with low self-estimates 


d be unlikely to lower their desires after finding out they are 
lower than they thought. 


TABLE 3.—CHANGE IN GRADE ASPIRATION WITH ASSUMED FALLIN 
INTELLECTUAL RANK 


Response Number of Subjects 
Try for better grades 18 
Try for less 50 
Not change at all 27 
No answer 8 G 
—¢ 
Q 
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This possible explanation of lack of change in aspiration was 
tested by splitting the subjects into high, middle, and low groups 
in regard to their own estimates of intelligence, cutting at points 
which most equalized the numbers in the three groups. There was, 
however, no significant difference between the three groups in the 
number of non-changers, thus indicating that the explanation lies 


æ elsewhere. 


3) Relationship of Strength of Motivation to Estimated Intellectual 


Rank.—As a measure of strength of motivation each subject was , — 


to indicate what his feelings were when he failed to achieve 

grade he had set as a goal in any particular instance; i.e., if he 

ailed’to do as well as he was trying to do in regard to grades. The 

indication of concern was made along a nine-point scale ranging 
from ‘indifferent? (0), to ‘extremely upset’ (9). 

Responses to this item were placed into high and low categories 
of concern by cutting at the point (between 6 and 7) which made 
most equal the numbers in each group. The number of responses 
in each category by subjects in the high, middle, and low groups in 
regard to self-estimated intellectual rank is given in Table 4. A chi 
square test indicates the difference in distribution of responses to 
be significant, with P <.05. 

The results support our contention that subjects with high self- 
estimates would show greater concern. However, they show in 
addition that many of the subjects with lowest estimates are also 
highly concerned. One explanation of this may be that subjects in 
this group are concerned for quite practical reasons. The scholastic 
average of this group is low. They are concerned not over threats 
to the ego-picture, as are the high subjects, but rather over the 


TABLE 4.—DEGREE oF CONCERN OVER FAILURE BY ESTIMATED 
INTELLECTUAL Ranx* 


Concern 
Rank 
Low High 
High 16 15 
Middle 27 7 
Low 17 14 


* There were.seven ‘no answers’ on the question about amount of con- 
cern, 


+ 
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y of being put on probation or dropped from school for 
keep up their averages. 
relationship between motivation and self-estimates was ex- 
an additional way, in a hypothetical situation. Each sub- 

was asked to indicate how he would feel if he failed to get the _ 
he would try for after finding out that he was much more 
nt than he thought. The same nine-point scaleWor indicat- 
responses was used. hi 
mean concern indicated in response to this hypothetical 
on was six on the scale. This was identical with the mean 
n expressed by the subjects over their actual failures. i 
ng was contrary to the prediction that the mean concern wo 
r in the hypothetical case. 
ntical information was collected in regard to a hypothetical 
overy of much lower intelligence. The mean concern indicated 
s five on the scale. A ¢ test indicates this to be significantly lower 
the concern expressed over actual failures, with P <.001. 
difference is in accord with our prediction. 

was no relationship between original self-estimates of 
‘and changes in degree of concern following the hypothetical 
in rank, This is in accord with the finding on lack of 
in grade aspiration under the same conditions. 


SUMMARY 


~ A questionnaire study was carried out to determine the relation- 

» between self-estimates of intelligence and college grades. A 

but significant correlation of .32 was found. When actual in- 

ce (A.C.E. percentile) is controlled, the correlation drops 

.20, which is slightly below the .05 level of significance. 

en the subjects were asked to assume that they discovered 

intellectual rank was much higher and much lower than they 

estimated, their reported aspiration level for grades showed a 
nding increase and decrease. 

nificantly more of the subjects in both high and low thirds of 

self-estimated intellectual ranking reported a high degree of 
over failure to achieve the grades they desired. 

en subjects were asked to estimate their concern over failure 

assuming their intellectual rank to be much higher and much 

than they had estimated, there was no difference from exist- 


Q 
© 


& 


484 The Journal of Educational Psychology 


ing concern over failures in the first case, but in the second it 
dropped significantly lower. 
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_ SPELLING ABILITY AND VOCABULARY 
LEVEL OF ONE HUNDRED COLLEGE 
FRESHMEN 


FRANK LAYCOCK 
© 


nt Professor of Education, University of California, Riverside, Calf. © 


High-school and college English teachers chronically deplore the 
or spelling they find in student essays and occasionally suggest 
it hampers effective expression. If an idea seems to deman 
in word, the argument goes, but the word is hard to Spell, AX. 
rudent writer will search for a substitute he knows he can spell. 
e chooses a less suitable substitute, clarity loses out to prudence, 
the solution must be to improve spelling. 
‘This reasoning permits three hypotheses: (1) the complexity of 
rose vocabulary correlates positively with spelling ability (so that 
oor spellers restrict their written vocabulary to words they can 
Jl); (2) vocabulary complexity shows no correlation with spelling 
bility; and (3) vocabulary complexity correlates negatively with 
spelling ability. This investigation tested these hypotheses on a 
group of college students. 
5 PROCEDURE 
 Spelling—In a preliminary trial Hudelson’s (3) ‘Seven S list 
ielded thirty-five words suitable for a college-level spelling test. 
n additional, harder words came from an unpublished scale 
Luther C. Gilbert, making a total of fifty words. A large class 
taining nearly one hundred and fifty freshmen took the spelling 
. Introductory directions explained the situation and invited 
coöperation; mimeographed sheets provided space for writing 
Personal data and the fifty words; an unhurried, friendly atmos- 
here dissipated tension. Each of the words in the test was pro- 
lounced clearly, read in a sentence, and pronounced again. The 
whole process for a word was repeated on request; to finish, the 
words alone were repeated in order. 
" Students—In the class tested, one hundred and forty freshmen 
turned in usable tests. Further selection eliminated: immigrants 
handicapped in English; students claiming freshman status whose 
“Tecords showed a year or more of college work; students whose 
A : 485 G 4 
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English placement tests (containing the essays to be used for meas- 
uring vocabulary) were not available; and students who had al- 
ready studied English composition at college. There were one 
hundred and two students left; two more were arbitrarily excluded 


to provide an even one hundred. These one hundred, from a course. 


which annually attracted a large student group and had no re- 
strictions d admission, were representative of freshmen beginning 
eir second semester of college. 
Vocabulary.—To measure written vocabulary, the essays were 
used which are part of the matriculation examination in English. 
entrant must take the test and is likely to try his best, be- 


gause it is widely known that failure means separate non-credit $ 


instruction and a certain popular stigma. Instructions clearly state 
that mechanical errors like spelling are penalized and that wise 
choice of words determines much of the total score. The suggested 
length of five hundred words is a generous sample on which to base 


_ an average. 


A separate tally sheet was prepared for each student, with space 
for pertinent data taken from the essay folder. Horn’s (2) 10,000- 
word list was used to measure complexity in preference to that of 
Thorndike (4) or Ayres (1) because it was based on school work 
and letters as well as published reading material. Horn considered 


_ derivatives separately, and had a system of weighting and credits 


which make the final tabulation as accurate as possible. For each 
word in a student’s essay which was among Horn’s 10,000 most 
common words, a score was tallied to indicate the weighted fre- 
queney assigned to it by Horn. This weighted score represented 
the frequency with which the word was met in tallying. over 
5,000,000 running words. A séparate space was reserved for words 
not in Horn’s list and for proper names (arbitrarily ignored by 
Horn). Words not among Horn’s 10,000 are presumably either 
more difficult or more specialized—and in either case would there- 
fore not be words most commonly used in discussion. However, the 
topics listed as acceptable for essays (for instance, “Juvenile delin- 
quency as a post-war problem”) called for special and perhaps 
difficult terminology. These uncommon words were tallied sepa- 


rately even though no weighted score was available, because leav- 


ing them out would eliminate any indication of either a rich 
vocabulary or the restrictions spelling might impose. 
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RESULTS 


ing.—For each test, the number of words correctly spelled 
converted in the usual way to standard scores having a mean 
50 and a o of 10. The raw scores ranged from 12 to 48 around 
mean of 30.7, with e = 6.8. Thus the range of standard scores 
rom 21 to 75. The raw scores were tested for skewness, and 
to be sufficiently normal in distribution to warrant conve- 
n to standard scores. 
Vocabulary —Analyzing vocabulary was complicated by the 
ry wide range of credit numbers (from 11 to over 700,Q00). 
Nalker’s (5) suggestion helped: “When the frequency piles up 
d of a distribution, use of a unit which is fine enough to brinkX. 
t the form of the distribution in that part of the range where the 
mber of cases is large may produce a very long table if the same 
val is used throughout. In such cases, the width of interval is 
etimes changed from one part of the range to another, Compu- 
ions may be carried out with the gross scores which correspond - 
the midpoints of the various intervals.” 
ecordingly, intervals were arranged, and for each word the 
oint of the interval within which its credit number fell was 
ed. Hach essay thus reduced to a tally of interval midpoints, 
the mean of these midpoints represented the vocabulary level 
the essay. For the one hundred essays, scores ranged from 
706 to 303,891 around a mean of 254,216, with e = 9,895. 
, in an essay given a score of 208,706, the average word was 
valent to one which Horn tallied 208,706 times in examining 
5,000,000 running words. These raw vocabulary scores were 
erted to standard scores for easy comparison with spelling 
ard scores. However, a reversal was necessary. Large spelling 
represent superior ability; large vocabulary scores indicate 
of common words—that is, in this context, a poorer vocabulary. . 
cabulary standard scores were therefore reversed by subtracting 
from 100 (theoretical maximum) before being entered, so. 
a large standard score would represent superior vocabulary. 
“Correlation of spelling and vocabulary scores.—The aim of the in- 
igation was to see how spelling and vocabulary relate to each 
. Correlating the two sets of scores gave a Pearson r = 075 
067, indicating a very nearly chance relationship. This absence 
correlation supports the second hypothesis, and it therefore 


© 


G 


e 


488 The Journal of Educational Psychology 


TABLE 1.—SrELLING Test; Best, MIDDLE, AND Worst STANDARD Scores 
WITH CORRESPONDING VOCABULARY STANDARD SCORES 


Worst Scores Middle Scores Best Scores 
Stud. | Splng. | Vocab. | Stud. | Splng. | Vocab. | Stud. | Sping. | Vocab. 
1 3 43 46 50 54* 91 63 | 60* 
2 36 47 51 49* 92 | 63 | 51 
3 28 65 48 51 52* 93 | 65 | 62* 
4 28 55 49 53 42 94 66 38 
5 28 42 50 53 62 +» 95 66 35 
6 28 53 51 53 59 96 66 45 
7: 30 57 52 54 47 97 66 43 
8 31 50 53 54 44 98 | 68 | 64* 
9 31 37 54 54 58* 99 | 68 | 66* 
10 31 34* 55 54 49* | 100 | 75 | 58 


* Vocabulary scores within five standard score points (= 0.5 ø) of cor- 
responding spelling scores. 


cannot be concluded that poor spelling ability noticeably constricts 
written vocabulary. s 

Further analyses.—In case the over-all correlation concealed re- 
lationships that might be revealed by extreme scores, two com- 
parisons weremade: (1) the best ten, worst ten, and middle ten 
spelling scores were paired with their corresponding vocabulary 
scores; and (2) the reverse comparison placed the best, worst, and 
middle vocabulary scores alongside corresponding spelling scores. 
Tables 1 and 2 show this comparison. The small number of entries 
in each group did not warrant elaborate statistical procedures. 
‘Close relationship’ was arbitrarily assigned to scores within five 
standard score points of each other, i.e., within 0.5 s. Using this 
criterion, there was little discernible relationship: isolating the 
best, worst, and middle scores did not reveal many closely related 
pairs. 

Another possibility suggested itself. Since the directions offered 
many topics on which to write essays, perhaps a comparison among 
students discussing the same topic would show a pattern. Students 
choosing a common theme may discuss it at different vocabulary 
levels. Vocabulary level here could be measured by considering the 
number of words not in Horn’s list, and the quality of vocabulary 
thus indicated compared with spelling ability. Table 3 groups four 
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BLE 2.—VocaBuLary Test: Bust, MIDDLE, AND Worst STANDARD 
SCORES WITH CORRESPONDING SPELLING STANDARD SCORES 


Middle Scores Best a 
Splng. Stud. Vocab. Splng. | Stud. | Vocab. | Splng. 
48 23 49 42 93 le 65* 
63 66 49 57 76 63 58* © © 
42 34 49 46* 98 64 68* 
57 55 49 54* 33 65 45 
60» 47 49 51* 3 65 
R 
31* 82 49 60 59 65 | 55 
66 8 51 31 99 66 68* aN 
50 20 51 42 16 69 40 
25 61 51 55 75 72 58 
55 69 51 58 73 74 58 


vo Four Essay Torics 


: * Spelling scores within five standard score points (= 0.5 a) of corre- 
ding vocabulary scores. 


LE 3.—SPELLING STANDARD SCORES AND NUMBER OF WORDS IN Essays 
_ Not IN Horn’s List, OF SELECTED STUDENTS GROUPED ACCORDING 


Topic B Topic C Topic D 

re Stud. | LEE Jwvords | Stua. | SOE | words | Stud. | SSK | words 
28 5 | 28 10 7 30 5 9 | 31 1 
37 15 | 39 12 | 14 | 39 28 17 | 40 22 
40 28 | 43 11 | 35 | 46 24 38 | 46 9 
43 45 | 50 15 | 41 50 11 43 | 50 2 
43 63 55 14 | 57 | 55 12 | 61 55 17 
46 65 | 57 9 | 80 | 60 40 62 | 65 13 
48 71 58 40 | 83 | 60 15 70 | 58 8 
50 77 58 20 | 87 | 62 6 | 8 | 62 vt 
53 79 | 60 0 | 90 | 63 6 
58 81 60 10 | 96 | 66 27 
58 84 60 2 | 98 | 68 22 
60 
62 ja 
63 
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topics which together accounted for half the essays. Within each 
group, students are arranged according to spelling score, with the 
corresponding vocabulary score entered to show the number of 
words which are not in Horn’s list. Here again no close relationship 
appears. 

These results apply to the restricted situation investigated. 
Other poplations, however, and other ways of measuring spelling 
and vocabulary may show different patterns. Intelligence or read- 
ing or motivation may influence spelling and vocabulary in ways 
separate from any mutual relationship between these two factors. ` 

ial fields at the college level, like mathematics, English, or 
oreigh languages, may have particular characteristics. 


CONCLUSIONS 


1) College freshmen vary in spelling ability and written vocabu- 
lary. Spelling variation in the group tested was similar to that in 
the general population, although about a higher mean. Vocabulary 
varied, too, but there are no general population data for com- 
parison. 

2) Of the three hypotheses set up for testing, only the second is 
justified: vocabulary complexity shows no correlation with spelling 
ability. There is no evidence in this study that freshmen inhibit 
their written vocabulary because of fear of misspelling. 

8) Since there is no relationship between spelling and vocabulary, 
spelling improvement by itself should not be expected to lead to 
better or freer written vocabulary. 


SUMMARY 


Tn this study a spelling test screened out one hundred university 
freshmen who had not studied English composition or spelling be- 
yond high school, and whose knowledge of English was not affected 
by foreign extraction or unusual experience. Their spelling scores 
were distributed normally from high to low. Each of these fresh- 
men had written as part of the entrance examination in English & 
500-word essay, and had been warned that choice of words and 
clarity of expression were important and that mechanical errors 
like misspellings would be penalized. Scores were assigned to each 
essay to indicate the average frequency of appearance of the words 
used, based on Horn’s (2) list of 10,000 English words most com- 
monly used in writing. Each student’s spelling and vocabulary 
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s compared. Both Pearson r (= .075) and inspection of 
cases found only a chance relationship between spelling 
written vocabulary level. The methods used with this 
‘one hundred freshmen show that low spelling ability does 
ly hamper vocabulary choice when there is pressure to 


ar settings and with different instruments should Help to tes a 
melusion. 
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RORSCHACH CONFIGURATIONS ASSOCIATED 
WITH COLLEGE ACHIEVEMENT: 


CHARLES C. McARTHUR AND STANLEY KING 


— m 


A Department of Hygiene, Harvard University 


* Vorhaus (5) recently defined four test patterns that she found 
accounting for nearly three-quarters of the Rorschachs she had 
collected from children attending a remedial reading center. She 
able to explain the children’s ‘reading problems’ as symptoms 
f long-standing character difficulties revealed by the four test ’ 
syndromes. The four test patterns that Vorhaus described may be 
characterized as follows: 

Type I is the merely formal record, with no other determinant 
save the form of the blot being used more than twice. 

Type II is the animal-movement dominated record, with nothing 
else save form being much used. 

Type III is the human-moyement dominated record, with no 
accompanying development of other determinants. 

Type IV is dominated by inanimate movement and color re- 
sponses, with no other determinants being much utilized. 

We will interpret the psychological meanings of these patterns as 
they arise in our discussion. Detailed interpretations of them are 
suggested in Vorhaus’ article. 

Although in age her subjects ranged from six to twenty-one, and | 
although academic difficulties are not always a result of ‘reading 
problems’, we thought it worth while to test her patterns as prog- 
nosticators of academic difficulties in college. Past attempts tore 
late the Rorschach to college adjustment have met more failure 
than success. In the most hopeful study to date, Munroe (4), using 
group administration of the test, showed a relation between Ror- 
schach patterns and student success at Sarah Lawrence, but an 
attempt to apply her method at the University of Chicago 0) 
brought negative results. 


i 
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1 This experiment was supported in part by the Department of Hygiene, 
Harvard University, the Harvard Medical School Department of Psychi- 
atry and the Department of Social Relations; and in part by Air Force Con- 
tract A.F. 33 (038) 201-42, from the School of Aviation Medicine, Randolph 
Field, Texas. 
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PROCEDURE 


_ The writers set out to compare the frequency of Vorhaus types 
ong individual Rorschach protocols collected from two groups: 
one hundred and thirty-seven referrals during the years 1950- 
"1952 to Harvard’s Department of Hygiene, all of these students 
being in some sort of academic or personal difficulty, ang all tested 
by McArthur; and (2) seventy-four controls, who were being teste 
“as part of Bother experiment at Harvard Medical School. The 
latter were all Harvard College juniors or seniors in the classes of 
52 and ’53 and were selected at random from the class lists. Tk 

, were classmates of, or at least undergraduates contemporaneo 
with, the Hygiene referrals. They were tested by King. 

Our expectations were first that Vorhaus types I and II would 
be rare in our two samples of college students, inasmuch as they 
seem to lack the resources to carry them so far in their education. 
Secondly, we expected that type IV would be common, and would 
“occur more frequently in the Hygiene cases than in the controls. 
 Vorhaus’ description of this type sounds like a common variety of 
‘college problem-student, known already to the physicians at 
Harvard. As Vorhaus says: 

“The implication . . . is that the subject is responsive to affec- 

tive stimulation; indeed, he is responsive to a point where moments 
‘of strong feeling occur as often, or almost as often, as do those 
"when a more surface pleasantness is all that is evoked.... The 
subject may have succeeded in repressing recognition that thie is 
60, 
“We again glimpse... the ‘good’ home, the submissive child, 
‘and the awareness of pressure... . Since the resentment (of the 
ressure) cannot be overcome, the psychological need becomes 
that of preventing it from being experienced as associated with the 
environment. This is done by turning the hostility against the self, 
for the ‘guilt’ of harboring the resentment .... With this accom- 
lished, the subject is able to feel that “Mother and Father are 
tirely just in all their demands and expectations. It is I who am 
guilty for not codperating with them. It is because of my inade- 
‘qQuacy and inferiority that their ‘good’ plans have not worked out.” 
_ This unhappy state of mind is seen all too commonly in the men 
“Who come, or are sent, to the Department of Hygiene for help. 
ley are usually in academic trouble as a matter of administrative 
act or else are ‘failing’ in their own eyes because they have not 
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met their own and their parents’ level of aspiration. In this respect 
these Hygiene cases are probably quite similar to the reading clinic 
cases seen by Vorhaus. 

Her description of type IV did not seem to fit what was known 
of the men in the control group. These men had been extensively 
studied in an experiment on laboratory stress-inducing situations 
42) and oħ the whole had been judged to be stable, well-adjusted 

_ students. Therefore, we had grounds for expecting that type IV 
would be less prevalent among this randomly selected group than 
ng the Hygiene cases. 
ore making direct comparisons, it was necessary to be certain 
that our results were not contaminated by differences in scoring. * | 
To test inter-scorer reliability, McArthur rescored twenty of 
King’s protocols, selecting those in which slight changes in scoring 
would be most likely to result in changes of type. The result was 
that only one test needed to be reclassified. The question of the 
effect of the personality of the tester on Rorschach responses could 
be checked in only one case when both of us tested one subject; 
in that instance, the configuration was not altered. 


5 


RESULTS 


When we compared the Hygiene cases with their controls, the 
results were as predicted. Vorhaus types I, II, and III were com- 
paratively rare, and in no case numerous enough to differentiate 
significantly between controls and Hygiene cases. Type IV proto- 
cols were more frequent, differentiating the Hygiene group from 
the control group at the .01 confidence level. Furthermore, Table 
1 shows that this contrast was reliable over two years of Hygiene 
experience. 

Two circumstances in the data (which are presented in Table 1) 
raised questions that needed exploration. Only half of the Hygiene 
cases were included in the four Vorhaus types and only one-quarter 
of the control cases. While that difference was in the predicted 
direction, and was itself a satisfactory finding, it leaves fifty to 
seventy-five per cent of the Rorschachs unexplained. Then, too, 
the contrast between Hygiene and controls with respect to type 
III records was, though insignificant, in the wrong direction. What 
is happening here? 

An examination of the Vorhaus check list defining type III 
shows that its characteristics are: a) M (human movement) has a 
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ILE 1.—ReELATIVE FREQUENCIES OF RorscHacH Types IN HYGIENE 
REFERRALS AND CONTROLS 


Hygiene ’50-’51 | Hygiene ’51-’52 Controls Significance 


7 per cent 2 per cent 7 per cent none 
2 per cent 0 per cent 3 per cent none 
5 per cent 6 per cent | 14 per cent a none o a 
35 per cent | 39 per cent 2 per cent -01 
= 55 82 74 


umerical value of 3 or more. b) FM (animal movement) Lee: X 
exceed one and one-half times M. c) m (inanimate movement) does 

“not exceed one-half times M. d) the color sum (weighted total of 

_ responses to color) is less than 3. 

On casual inspection, one would not take this as a syndrome that 
would bode ill for academic performance or for reading skills. Tra- 
ditionally, the M response has been taken to be the intellectually 
‘best’ kind of response, and type III seems to be an M-dominated 
record. 

The rationale, of course, is that the type III person has great 
"assets, as shown by his movement responses, but that these assets 
not put to socialized use. Without adequate development of the 
‘color responses (as quantified by condition d), one suspects that 
_ the type III man has a rich but too exclusively inner mental life. _ 
Orhaus (5) feels that “the instinctual drives have been integrated 
to the individual’s total functioning, thus becoming the source 
creative energy and achievement,” but that there is “failure, 
oth in the capacity to establish rapport and in the ability to react 
to the environment deeply and genuinely.” The result, she sug- 
_ gests, is a “determination to preserve his creative drives as his 
' personal (and secret) treasure.” 
There are those who would say that such a person would be a 
‘natural’ student for Harvard. This is the born resident of the 
ivory tower, who should fit into a university that has been charac- 
terized by F. L. Wells (7) as ‘dominantly cerebrotonic’. Yet even 
‘eerebrotonic’ schools demand that students pass tests. Even in 
‘such institutions, some ‘capacity to establish rapport’ is a pre- 
requisite for getting grades. It should therefore be the type III 
People, with the addition of adequate color response, who make © 
Á ©] 5 is 
à 
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the best Harvard students. If the color response may be taken as 
indicating the ability to relate inner resources to outer environ- 
ment, such modified type III subjects should presumably display 
. Tesources that are available for socialized use and are not, like the - 
resources of pure type III people, retained as ‘private treasures’. 
We therefore may define a new type, IITA, as people who meet 
o othe first three criteria of Vorhaus type III but are not bound by 
type III’s fourth limitation. Type IITA people show: a) M with a 
value of 3 or more. b) FM less than one and one-half times M. c) 3 
pyless than one-half times M. d) color sum of 3 or more. 
Empirically, they turn out very well. They are found in our, 
4 groups at the following rates: ; 


Hygiene ’50-'51 | Hygiene ’51-’52 Controls Significance 


Type IIIA 7 per cent 7 per cent 33 per cent 01 


The difference is in the predicted direction and reaches a good level 
of confidence. Once again, the Hygiene figures look reliable. 

An even stricter definition of the ideal student’s Rorschach 
might demand that his color responses be developed in the direc- 
tion of form-dominated color responses rather than of the use of 
color alone or color dominating form as a determinant. That, on 
orthodox Rorschach theory (3), would indicate not only ability to 
relate oneself to the environment, but specific skill at tying one’s 
impulses in with the social environment. It is usually FC that is 
taken as the sign of ‘ability to establish rapport’. 

So we made a stricter definition of our ‘good student’ subtype, 
demanding, beside the criteria for type IITA, that FC (form-color 
responses) should numerically exceed CF (color-form) and C (pure 
color). For this narrower group (which, of course, is selected within 
the ranks of type IITA), we find these frequencies: 


Hygiene 5051 | Hygiene ’s1-’52 Controls Significance 


Type IIIB 0 per cent 1l per cent | 22 per cent .01 


Though we lose cases, the exceptions to our rule are almost elimi- 
nated as we sharpen our definition. In fact, the lone Hygiene 
record showing this type of M and FC dominated scores was made 
ə 
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by a boy who had been referred for purely vocational guidance 
reasons and not for the usual academic, personal and administrative 
troubles. 

We must hasten to point out that well-developed M and FC re- 
sponses are usually sufficient but never necessary conditions for 
academic success. Wells (6) has shown, in a recent stugy at Har- 
vard, that twenty per cent of the National Scholars here actually” 
had no human movement responses! 

From these facts, we may reach certain firm conclusions and 
suggest other likely hypotheses. y 

First, we have offered additional evidence for the validity o 
Vorhaus’ typology. Those types for whom she suggests minimal re- 
sources are not much seen at Harvard, while her type IV, the only 
one commonly admitted to this college, is found almost exclusively 
among people who were in academic difficulty. (We might add that 
type III, seen at Harvard with some frequency and not character- 
istic of Hygiene cases, was the type for which Vorhaus foresaw the 
best prognosis.) The fact that her scheme could be transferred 
from New York to Cambridge and from pre-college to college stu- 
dents suggests that it possesses more widespread validity than is 
always found in methods for identifying poor students with the 
Rorschach. 

Second, both Vorhaus’ types and our extensions of her typology 
to include some ‘desirable’ academic syndromes, are based on usual 
Rorschach theory. Both typologies have been empirically shown 
to be valid and reliable. We have therefore some inferential evi- 
dence that the Rorschach, interpreted in the customary manner, 
can lead to correct conclusions about college performance. The 
limitation seems to be that only some Rorschachs point directly to 
good or poor academic performance, while other Rorschach pat- 
terns occur infrequently or are ambiguous as to their significance. 

Third, the positive results shown here point out the need of 
studies in which the Rorschachs are administered as the student 
enters college. Ours is a study after the fact. We cannot establish 
whether the test patterns we observed were long-term character- 
istics of the men tested or whether the test they gave reflected their 
recent experience with years or months of college success or failure. 
The need is to administer the test to a sample of an entering class 
and immediately make sealed predictions, to be checked after the 


student has accumulated his good or poor academic record. Our 
o 
rs 
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present findings suggest that such a study would have positive and 
practically applicable results. 
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K ÀA COMPARISON OF THE STANFORD-BINET, 
REVISED FORM L, AND THE CALIFORNIA 
TEST OF MENTAL MATURITY (S-FORM): 


WILLIAM D. SHELDON 


{) 
Director of the Reading Laboratory, Syracuse University ec © 
and 
GEORGE MANOLAKES S 
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Ball State Teachers College, Muncie, Ind. z T 


An investigation was made of the characteristics of good and 
poor readers in grades I-XII in eight school systems in the State 
of New York. The present article presents the results of the Stan- 
ford-Binet Intelligence Scale, Revised Form L, and the California 
- Test of Mental Maturity (S-Form), administered to each child 

_ studied in grades I-VI. This study evolved from the need of com- 
paring the two tests in order to assess the intellectual status of the 
pupils tested in a more accurate manner. The Grace Arthur, Form 
II, was also administered to each of the pupils in the study and a 
comparison between this test and the Stanford-Binet has been 

published.” 


SELECTION OF SUBJECTS 


The subjects represent ten per cent of all the pupils in grades 
I-VI in the eight schools participating in the study. The popula- 
tion included 4220 pupils from fourteen elementary and four rural 
schools, Included in this study were four hundred and twenty-two 
pupils. They were selected by their teachers in each of the schools. 
In making their selections, the teachers used three criteria: (1) 
achievement tests in reading, administered before selection; (2) 


1 Authors wish to acknowledge the codperation of the Bureau of School 
Services, School of Education, and the Psychological Services Center of 
Syracuse University in making this research project possible. 

3 2G. Manolakes and W. D. Sheldon, “A comparison of the Grace Arthur, 
Revised Form II, and the Stanford-Binet, Revised Form L”, Journal of 
Educational and Psychological Measurement, 12, Spring 1952, pp. 105-108. 
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their own rating of each pupil’s status in reading; and (3) test 
scores derived from intelligence tests administered before the selec- 
tion. The teachers were asked to choose from their classes, five per 
cent of their pupils who were good readers and five per cent who 
were poor readers. The four hundred and twenty-two pupils tested 
ranged in ages from five years ten months to twelve years eleven 
months. è 

2 


PROCEDURE 


The Stanford-Binet and the California Test of Mental Maturity ° 


e administered to each of the four hundred and twenty-two 
pupils in the study by members of the research team. The con- * 
ditions under which the tests were administered were the best that 
could be obtained in the various schools. The Stanford-Binet test 
was administered individually by skilled clinicians while the Cali- 
fornia Test of Mental Maturity was administered to small groups 
by individuals trained in giving this instrument. The time interval 
between the two tests ranged from one day to four months, with an 
average interval of four weeks. 


RESULTS 


Table I reveals that there is no significant difference between 
the means of the California Test of Mental Maturity and the 
Binet. This difference was measured by both the ¢ test and by criti- 
cal ratio of the difference and its standard error. A coefficient of 
correlation of .702 was obtained from a comparison of the distribu- 
tions. The stability of this statistic was determined by Fisher’s r 
to Z transformation; and at the .01 level, this correlation coefficient 
would range .629 to .757. 

Table I also reveals that the Binet distribution is skewed signifi- 
cantly. This might be expected due to the manner in which this 
group was selected. However, the results of the California Test of 
Mental Maturity were not significantly skewed. The measure of 
kurtosis for both distributions cannot be considered significantly 
different from normal. 

According to Table II, 44.6 per cent of the scores obtained from 
the same individuals differed greater than 10 IQ points. Twenty- 
four and six-tenths per cent showed a difference greater than 15 
IQ points; while 13.7 per cent of the cases showed differences 
greater than 20 points. 
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TABLE 1—Frequency DISTRIBUTION AND CONSTANTS FOR CALIFORNIA 
Testr oF MenTAL MATURITY (S-Form) AND STANFORD-BINET, 
Revisep Form L 


Binet IQ CTMM 
6 160-169 0 
11 150-159 4 
11 140-149 10 
28 130-139 35 
61 120-129 6 
66 110-119 68 o 
86 100-109 91 
76 90-99 95 
59 80-89 33 
14 70-79 12 Y 
4 60-69 TALTE 
0 50-59 0 
0 40-49 1 SS 
422 M 422 
108.443 Mean 107.806 
925 Sm 819 
Smi-ma - 680 
CR -937 
iy 847 
106.244 Median 106.423 
18.984 S.D. 16.804 
653 Sa 578 
Serna +623 
CR 3,497 
P: 0002 
66-169 Range 47-156 
28.135 QoQ 25.881 
Skewness 
491 g .015 
119 Se «119 
00002 P 4433, 
Kurtosis 
0396 g: — .0136 
238 Sa -238 
4325 P t; .4761 
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Correlation 

R -702 

8; -0247 

z -867 

8s -049 

Z + 2.586, -741 to .993 
r range + -629 to .757 

K 


9 8 
TABLE II.—DIFFERENCES IN IQ Pornts BETWEEN THE STANFORD-BINET, 
Revisep Form L AND THE CALIFORNIA TEST OF MENTAL MATURITY 


P (S-Form) ` 
4 Binet Higher CTMM Higher 
E DES. EO TQ Difference 

A per cent N N per cent 
15.2 64 1-5 65 15.4 

T.t 47 6-10 49 11.6 

10.0 42 11-15 42 10.0 

4.7 20 16-20 26 6.2 

3.6 15 21-25 9 2.1 

1.7 7 26-30 10 2.4 

1.4 6 31-35 4 0.9 

0.7 3 36—40 1 0.2 

0.5 2 * 41-45 0 0.0 

0.2 1 46-50 0 0.0 

49,1 207 No difference 9 cases 206 48.8 

2.1 per cent 


The average algebraic and arithmetic differences between the 
test at the chronological age levels are indicated by Table III. The 
general trend of the average algebraic differences between the 
scores for individuals indicates that the California Test for Mental 
Maturity yields scores higher than the Binet at the six- and seven- 
year level, while the Binet was higher at ages nine, ten, and 
eleven. The scores were in greatest agreement at the eight-year 
level. The differences at ages twelve and thirteen are not considered 
as too indicative or significant because of the small number of 
students at these age levels, 

The average arithmetic differences between the IQ scores indi- 
cate an average absolute difference of 10.943 between the two 
tests. The greatest difference is in the ten-year-old group. 

In Table IV a significant trend is revealed by a comparison of 

a the results of the two tests by IQ groups. The average algebraic 
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Taprn III.—AVERAGE ALGEBRAIC AND ARITHMETIC DIFFERENCES BETWEEN 
THE STANFORD-BINET, Revisep Form L, AND THE CALIFORNIA TEST OF 
MENTAL MATURITY (S-Form) at THE CHRONOLOGICAL AGE LEVELS 


Age N Algebraic Difference* | Arithmetic Difference 
6 6l —4.197 @i.527 
ae 7 82 —1.890 9.415 © @ 

8 70 0.314 8.788 

9 71 3.606 12.535 

10 OT 3.839 14.446 

1 59 3.305 10.763 A 

12 18 —2.718 3.278 a 

13 5 3.000 9.000 Ne, 

Total 422 Average 0.573 10.943 


* Negative numbers indicate difference in favor of the California Test 
of Mental Maturity scores. 


differences indicate that at the upper Binet levels, the Binet scores 
are higher than those of the California Test of Mental Maturity. 
This difference decreases almost progressively until the average 
range is reached. At and below the average range, the California 
Test of Mental Maturity yields a higher IQ. 

Regression equations have been computed to demonstrate the 
disparity between the tests on different IQ levels. The tables 
further substantiate the differences already indicated. The re- 


Tasty IV.—Averace ÅLGEBRAIC DIFFERENCES BY IQ Grours 
DETERMINED BY Binet IQ 


N Algebraic Difference* 
160-169 6 33.833 
150-159 11 18.273 
140-149 il 13.909 
130-139 28 10.607 
120-139 61 2.787 
110-119 66 5.712 
100-109 86 —2,965 
90-99 76 —4,566 
80-89 59 —6,475 
70-79 14 —9,143 
60-69 4 —0.075 
* Negative numbers indicate difference in favor of the California Test © 
of Mental Maturity. o 
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gression formula for predicting Binet IQ’s from C.T.M.M. 1Q’s is: 
X= 7938Y + 22.953 = sy = 13.517 ` 

The regression formula for predicting C.T.M.M. 1Q’s from Binet 
1Q’s: Y! oe + 40.463 Yiz = 11.964. 
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